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Sir: 



mo 



SUBMISSION OF FORMAL DRAWINGS 



Subject to the approval of the Examiner, please replace the informal drawings 
with the 185 of formal drawings filed herewith (Figures 1-57B). If the formal drawings for 
any reason are not in full compliance with the pertinent statutes and regulations, please 
so advise the undersigned. 

Applicants note that in Figure 11B, bases 175-177 are improperly shown as 
"CCC," when they should be "CGC," which is the codon that codes for "arg." Applicants 
have made the change to these drawings and in the Sequence Listing. Support for the 
amendment is found in the originally filed Figure, which shows "arg." 

Applicants also note that in Figure 16B, bases 391-393 are improperly shown as 
"TCC," when they should be "TTC," which is the codon that codes for "phe." Applicants 
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have made the change to these drawings and in the Sequence Listing. Support for the 

amendment is found in the originally filed Figure, which showed "phe." 

If any fees are necessary for the submission of these formal drawings, please 

charge our Deposit Account No. 06-0916. 

Respectfully submitted, 

FINNEGAN, HENDERSON, FARABOW, 
GARRETT & DUNNER, L.L.P. 



Dated: March 26, 2003 



By: 



Myerj£-Payi 




Sean C. Myer$-Payne 
Reg. No. 42,920 
ph. 571 203 2717 
fax 202 408 4400 
sean.myers-payne@finnegan.com 
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SEQ ZD NO: 1 

k l GGATCCCAGGGAACGTGACC ATG GTC GTA GGG ATG ACT TGA C AGTTTC AAC GGGGTGC G AC C AC C GTTGC GC 72 
'1 4, (M V V G M T) * 7 

SEQ ID NO: 2 ^ - — SEQ ID NOs 3 



73 


TCAGAAGGCATACGTTGGTGGAACACGTCGGAAAGCTGGGAGGTGAATCTG ATG 


GCT 


GGC 


GAC 


CAA 


GAG 


CTG 


144 


1 


<M 


A 


G 


D 


Q 


E 


L 


7 




^ — SEQ. 


, ID 


NO: 


4 










145 


GAA CTG CGG TTC GAC GTT CCT CTT TAC ACG CTT GCC GAG GCA 


TCG 


CGG 


TAC 


CTG 


GTG 


GTT 


204 


8 


ELRFDVPLYTLAEA 


S 


R 


Y 


L 


V 


V 


27 



205 CCC CGC GCC ACC CTG GCT ACG TGG GCT GAC GGC TAC GAG CGT CGG CCG GCC AAC GCA CCG 2 64 
28 PRATLATWADGYERRPANAP 47 



265 GCG GTC CAG GGG CAA CCG ATC GCC TTT GAC GCC TAT TCG GTC GCG CAG CTT TTT GGC GAC 3 24 

48 AVQGQP IAFDAYSVAQLFGD 67 

32 5 GTC ACT GGT GCC CGC GTT GCG GGC GTC CAG CCG CAG CGA CAC CAC ATA CGG CCG GTC CGG 3 84 

68 VTGARVAGVQPQRHHI R PVR 87 

3 85 TTG CGG GGG CCG TTG GGT GGG GTT GGG TGC CTC CGT CAC CCC AGG CAG TTC GCT GGC TAT 444 

88 LRGPLGGVGCLRHPRQFAGY 107 

445 TTG TCG CAG TAG CGCGACGGCATTGTCG ATG TCT TGG TAG C T AGC ATC C GGTC GGGGGGC CGC TAC C AGC G 515 

108 L S Q) * " MS W* 4 

516 CCAGCGCCGGGGCTCCCCGGTCCGGGTAGTGCGCGTCGAGTTGGTCGTGGACCAGCA ATG ACT GCG ACC CGG 587 

1 SEQ. ID NO: 5 . (M T A T R 5 

588 CGA CTT CGA AAC CGC CAC CGG TTA GAT TCC CCG ACT GCG TCA TCG CCA GGT AAA CCG CCG 647 

6RLRNRHRLDSPTAS S PGKP P 25 

648 GCA CTA ACG CCA GCA ACC AAC CCG TGA AGACCAACCAACGGCACCTGCGCAGGTTGCGGCTCAACCGCATC 718 

26 ALT PATN P) * 34 

719 ATG AAC TGC TGG ATT TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT GCC CGC/ GAG CCT ACC 778 

CMMN-CWI SDSPYSRAVRAREPT 20 
SEQ ID NO: 6 

779 GAA GAT CGC GTG CAT GCG TTC GGC GTG GAC CGC ACA GCA CCT GGA GTT GGC GGC GCC GAG 83 8 

21 EDRVHAFGVDRTAPGVGGAE 40 

839 GGC CGA GAT GGC AGG ATG ACG GAT CGT CGG GGG CGG GAA CTC CCA GGC CGC CGG ACC GTC 898 

41 GRDGRMTD RRGRELPGRRTV 60 



899 


GCA 


AAC 


CCG 


TCG 


CAA 


ACC 


CGT 


CGC 


AAA 


CCG 


TAA 


GGAGTCATCC ATG AAG ACA GGC ACC GCG 


959 


61 


A 


N 


P 


S 


Q 


T 


R 


R 


K 


P) 


* 






J? (M K T G T A 


6 






























— SEQ. 


ID NO: 7 


t 








960 


ACG 


ACG 


CGG 


CGC 


AGG 


CTG 


TTG 


GCA 


GTA 


CTG 


ATC 


GCC 


CTC 


GCG 


TTG 


CCG 


GGG 


GCC 


GCC 


GTT 


1019 


7 


T 


T 


R 


R 


R 


L 


L 


A 


V 


L 


I 


A 


L 


A 


L 


P 


G 


A 


A 


V 


26 


1020 


GCG 


CTG 


CTG 


GCC 


GAA 


CCA 


TCA 


GCG 


ACC 


GGC 


GCG 


TCG 


GAC 


CCG 


TGC 


GCG 


GCC 


AGC 


GAA 


GTG 


1079 


27 


A 


L 


L 


A 


E 


P 


S 


A 


T 


G 


A 


S 


D 


P 


C 


A 


A 


S 


E 


V 


46 


1080 


GCG 


AGG 


ACG 


GTC 


GGT 


TCG 


GTC 


GCC 


AAG 


TCG 


ATG 


GGC 


GAC 


TAC 


CTG 


GAT 


TCA 


CAC 


CCA 


GAG 


1139 


47 


A 


R 


T 


V 


G 


S 


V 


A 


K 


S 


M 


G 


D 


Y 


L 


D 


S 


H 


P 


E 


66 


1140 


ACC 


AAC 


CAG 


GTG 


ATG 


ACC 


GCG 


GTC 


TTG 


CAG 


CAG 


CAG 


GTA 


GGG 


CCG 


GGG 


TCG 


GTC 


GCA 


TCG 


1199 


67 


T 


N 


Q 


V 


M 


T 


A ' 


V 


L 


Q 


Q 


Q 


V 


G 


P 


G 


S 


V 


A 


S 


86 


1200 


CTG 


AAG 


GCC 


CAT 


TTC 


GAG 


GCG 


AAT 


CCC 


AAG 


GTC 


GCA 


TCG 


GAT 


cc 












1243 


87 


L 


K 


A 


H 


F 


E 


A 


N 


P 


K 


V 


A 


S 


D) 














100 



SEQ ID NOS. 1-7 

FIG. 1 



RECEIVED 

MAR V- 8 7003 



J- 



of the clone containing DP42 8 and contained in seql 



y~ SEQ ZD NO: 8 














31/11 


















p a rn ppp 


CTT 


TGA 


CGC 


CTA 


TTC 


GGT 


CGC 


GCA 


GCT TTT 


TGG 


CGA 


V^V? J. 


PTtP 

^.AV— 


mpp 


J. 


ppp 




asp arg 


leu 


OPA ( arg 


leu 


phe 


gly 


arg 


ala 


ala phe 




arg 


arg 


his 


f rn 


cys 


pro 


arg 


61/21 




£SEQ 


ID NO: 9 






J. / J -L 


















TGC GGG 


CGT 


CCA 


GCC 


GCA 


p r~*r~< 


ACA 


CCA 


CAT 




ppm 


ppp 




ppp 




ppp 




ppp 
wow 


cys gly 


arg 


pro 


ala 


ala 


ala 


thr 


pro 


his 


uni ala. 


gly 


pro 


vai 


aia 


gly 


aia 


vai 


gly 


121/41 


















1 CI /CI 
131/ Dl 


















TGG GGT 


TGG 


GTfl. OOB^CCG 


TCA 


CCC 


CAG 


GCA 


prprp ppp 


rpPP 


prp j\ 

C 1 A 


TTT 


pmp 

GTC 


PP A 

GCA 


GTA 


ppp 

GCG 


CGA 


trp gly 


trp 


val pro pro 


ser 


pro 


glri 


ala 


val arg 


trp 


leu 


phe 


vai 


aia 


vai 


aia 


arg 


181/61 


















211/71 


















CGG CAT 


TGT 


CGA 


TGT 


CTT 


GGT 


AGC 


TAG 


CAT 


CCG GTC 


GGG 


GGG 


PPP 


CTA 


CCA 


GCG 


CCA 


GCG 


arg his 


cys 


arg 


cys 


leu 


gly 


ser ) AMB (his 


pro val 


gly 


gly 


pro 


1 eu 


pro 


ala 


pro 


= i = 

d J. a. 








SEC 


i id 


NO: 


10 — 




271/91 


















CCG GGG 


CTC 


CCC 


GGT 


CCG 


GGT 


AGT 


GCG 


CGT 


CGA GTT 


GGT 


CGT 






pp A 


J\ rpp 
A 1 \J 


A prp 


ppp 


pro gly 


leu 


pro 


gly 


pro 


gly 


ser 


ala 


arg 


arg val 


gly 


arg 


gly 


pro 


did 


me t 


LilX 


a. X a. 


301 / 101 


















331/111 


















ACC CGG 


CGA 


CTT 


CGA 


AAC 


CGC 


CAC 


CGG 


TTA 


GAT TCC 


CCG 


ACT 


ppp 


rpp a 
1 t-A 


mpp 


pp A 


pprp 


A 7X & 
AAA 


thx arg 


arg 


leu 


arg 


asn 


arg 


his 


arg 


leu 


asp ser 


pro 


thr 


ala 






pro 


gly 


lys 


JOl/ lzl 


















391/131 


















prip ppp 


GCA 


CTA 


ACG 


CCA 


GCA 


ACC 


AAC 


CCG 


TGA AGA 


CCA 


ACC 


AAC 


GGC 


ACC 


TGC 


GCA 


GGT 


pro pro 


ala 


leu 


thr 


pro 


ala 


thr 


asn 


pro) OPA (arg 


pro 


thr 


asn 


gly 


thr 


cys 


ala 


y i-y 


/I O 1 /1 Jl 1 
*±Z± / 141 


















451/151^-SEQ ID 


NO: 


11 










TGC GGC 


TCA 


ACC 


GCA 


TCA 


TGA 


ACT 


GCT 


GGA 


TTT CGG 


ACT 


CCC 


CGT 


ACT 


CTC 


GCG 




mpp 


cys gly 


ser 


thr 


ala 


ser) OPA (thr 


ala 


gly 


phe arg 


thr 


pro 


arg 


thr 




ala 


gin 


cys 


4ol/ lol 


SEQ XD NO: 


: 12 










511/171 


















CCC 


GCG 


AGC 


CTA 


CCG 


AAG 


ATC 


GCG 


TGC 


ATG CGT 


TCG 


GCG 


TGG 


ACC 


GCA 


CAG 


CAC 


CTG 


val pro 


ala 


ser 


leu 


pro 


lys 


ile 


ala 


cys 


met arg 


ser 


ala 


trp 


thr 


ala 


gin 


his 


leu 


541/181 


















571/191 


















GAG TTG 


GCG 


GCG 


CCG 


AGG 


GCC 


GAG 


ATG 


GCA 


GGA TGA 


CGG 


ATC 


GTC 


GGG 


GGC 


GGG 


AAC 


TCC 


glu leu 


ala 


ala 


pro 


arg 


ala 


glu 


met 


ala 


gly) OPA 


(arg 


ile 


val 


gly 


y J- Jr 


y -Y 


asn 


ser 


bUl/ zUl 


















631/211 


*-SEQ ZD NO: 13 










CAG GCC 


GCC 


GGA 


CCG 


TCG 


CAA 


ACC 


CGT 


CGC 


AAA CCC 


GTC 


GCA 


AAC 


CGT 


AAG 


GAG 


TCA 


TCC 


gin ala 


ala 


gly 


pro 


ser 


gin 


thr 


arg 


arg 


lys pro 


val 


ala 


asn 


arg 


iy s 


glu 






661/22 1 


















691/231 


















a mp aap 


ACA 


GGC 


ACC 


GCG 


ACG 


ACG 


CGG 


CGC 


AGG CTG 


TTG 


GCA 


GTA 


CTG 


ATC 


GCC 


CTC 


GCG - 


met lys 


thr 


gly 


thr 


ala 


thr 


thr 


arg 


arg 


arg leu 


leu 


ala 


val 


leu 


ile 


ala 


leu 


ala 


721/241 


















751/251 


















TTG CCG 


GGG 


GCC 


GCC 


GTT 


GCG 


CTG 


CTG 


GCC 


GAA CCA 


TCA 


GCG 


ACC 


GGC 


GCG 


TCG 


GAC 


CCG 


leu pro 


gly 


ala 


ala 


val 


ala 


leu 


leu 


ala 


glu pro 


ser 


ala 


thr 


gly 


ala 


ser 


asp 


pro 


781/261 


















811/271 


















TGC GCG 


GCC 


AGC 


GAA 


GTG 


GCG 


AGG 


ACG 


GTC 


GGT TCG 


GTC 


GCC 


AAG 


TCG 


ATG 


GGC 


GAC 


TAC 


cys ala 


ala 


ser 


glu 


val 


ala 


arg 


thr 


val 


gly ser 


val 


ala 


lys 


ser 


met 


gly 


asp 


tyr 


841/281 


















871/291 


















CTG GAT 


TCA 


CAC 


CCA 


GAG 


ACC 


AAC 


CAG 


GTG 


ATG ACC 


GCG 


GTC 


TTG 


CAG 


CAG 


CAG 


GTA 


GGG 


leu asp 


ser 


his 


pro 


glu 


thr 


asn 


gin 


val 


met thr 


ala 


val 


leu 


gin 


gin 


gin 


val 


gly 


901/301 


















931/311 


















CCG GGG 


TCG 


GTC 


GCA 


TCG 


CTG 


AAG 


GCC 


CAT 


TTC GAG 


GCG 


AAT 


CCC 


AAG 


GTC 


GCA 


TCG 


GAT 


pro gly 


ser 


val 


ala 


ser 


leu 


lys 


ala 


his 


phe glu 


ala 


asn 


pro 


lys 


val 


ala 


ser 


asp) 



SEQ XD NOS:8-13 



FIG. 1A" 



nsert of the clone containing DP428, other reading frame 



27 1 SEQ ID NO: 


14 












32/11 


















ATC GCC 


TTT 


GAC 


GCC 


TAT 


TCG 


GTC 


GCG 


CAG 


CTT TTT 


GGC 


GAC 


GTC 


ACT 


GGT 


GCC 


CGC 


GTT 


(ile ala 


phe 


asp 


ala 


tyr 


ser 


val 


ala 


gin 


leu phe 


qlv 


asp 


val 


thr 


gly 


ala 


arg 


val 


62/21^SEQ ID NO: 15 










92/31 


















GCG GGC 


GTC 


CAG 


CCG 


CAG 


CGA 


CAC 


CAC 


ATA 


CGG CCG 


GTC 


CGG 


TTG 


CGG 


GGG 


CCG 


TTG 


GGT 


ala gly 


val 


crln 


pro 


gin 


arg 


his 


hi s 


ile 


ara nro 


val 


arg 


leu 


arg 


gly 


pro 


leu 


crlv 


122/41 


















152/51 


















GGG GTT 


GGG 


TGC 


CTC 


CGT 


CAC 


CCC 


AGG 


CAG 


TTC GCT 


GGC 


TAT 


TTG 


TCG 


CAG 


TAG 


CGC 


GAC 


crlv val 


gly 


cys 


leu 


arg 


his 


pro 


arg 


gin 


phe ala 


crlv 


tyr 


leu 


ser 


gin) AMB (arg 


asp 


182/61 


















212/71 






SEQ 


ID NO: 16^> 






GGC ATT 


GTC 


GAT 


GTC 


TTG 


GTA 


GCT 


AGC 


ATC 


CGG TCG 


GGG 


GGC 


CGC 


TAC 


CAG 


CGC 


CAG 


CGC 


crlv ile 


val 


asp 


val 


leu 


val 


ala 


ser 


ile 


arg ser 


crlv 


gly 


arg 


tyr 


gin 


arg 


gin 


arcr 


242/81 


















272/91 


















CGG GGC 


TCC 


CCG 


GTC 


CGG 


GTA 


GTG 


CGC 


GTC 


GAG TTG 


GTC 


GTG 


GAC 


CAG 


CAA 


TGA 


CTG 


CGA 


arg gly 


ser 


pro 


val 


arg 


val 


val 


arg 


val 


glu leu 


val 


val 


asp 


gin 


gln)OPA( leu 


arg 


302/101 


















332/111 




SEQ ID 


NO: 


17— 






CCC GGC 


GAC 


TTC 


GAA 


ACC 


GCC 


ACC 


GGT 


TAG 


ATT CCC 


CGA 


CTG 


CGT 


CAT 


CGC 


CAG 


GTA 


AAC 


pro gly 


asp 


phe 


glu 


thr 


ala 


thr 


gly)AMB(ile pro 


arg 


leu 


arg 


his 


arg 


gin 


val 


asn 


362/121 






SEQ 


ID NO: 18- 




J* 


'392/131 


















CGC CGG 


CAC 


TAA 


CGC 


CAG 


CAA 


CCA 


ACC 


CGT 


GAA GAC 


CAA 


CCA 


ACG 


GCA 


CCT 


GCG 


CAG 


GTT 


arg arg 


his)OCH(arg 


gin 


gin 


pro 


thr 


arg 


glu asp 


gin 


pro 


thr 


ala 


pro 


ala 


gin 


val 


422/141 






^ — SEQ ID NO: 19 




452/151 


















GCG GCT 


CAA 


CCG 


CAT 


CAT 


GAA 


CTG 


CTG 


GAT 


TTC GGA 


CTC 


CCC 


GTA 


CTC 


TCG 


CGC 


AGT 


GCG 


ala ala 


gin 


pro 


his 


his 


glu 


leu 


leu 


asp 


phe gly 


leu 


pro 


val 


leu 


ser 


arg 


ser 


ala 


482/161 


















512/171 


















TGC CCG 


CGA 


GCC 


TAC 


CGA 


AGA 


TCG 


CGT 


GCA 


TGC GTT 


CGG 


CGT 


GGA 


CCG 


CAC 


AGC 


ACC 


TGG 


cvs t>ro 


arg 


ala 


tyr 


arg 


arg 


ser 


arg 


ala 


cys val 


arg 


arg 


gly 


pro 


his 


ser 


thr 


trp 


542/181 


















572/191 


















AGT TGG 


CGG 


CGC 


CGA 


GGG 


CCG 


AGA 


TGG 


CAG 


GAT GAC 


GGA 


TCG 


TCG 


GGG 


GCG 


GGA 


ACT 


CCC 


ser trp 


arg 


arg 


arg 


gly 


pro 


arg 


trp 


gin 


asp asp 


gly 


ser 


ser 


gly 


ala 


gly 


thr 


pro) 


602/201 


















632/211 


















AGG CCG 


CCG 


GAC 


CGT 


CGC 


AAA 


CCC 


GTC 


GCA 


AAC CCG 


TCG 


CAA 


ACC 


GTA 


AGG 


AGT 


CAT 


CCA 


arg pro 


pro 


asp 


arg 


arg 


lys 


pro 


val 


ala 


asn pro 


ser 


gin 


thr 


val 


arg 


ser 


his 


pro 


662/221 


















692/231 


















TGA AGA 


CAG 


GCA 


CCG 


CGA 


CGA 


CGC 


GGC 


GCA 


GGC TGT 


TGG 


CAG 


TAC 


TGA 


TCG 


CCC 


TCG 


CGT 


OPA(argl 


l gin 


ala 


pro 


arg 


arg 


arg 


gly 


ala 


gly cys 


trp 


gin 


tyr)OPA(ser 


pro 


ser 


arg 


722/241 




SEQ 


ID NO: 20 








752/251 


SEQ ID 


NO: 


21 — 










TGC CGG 


GGG 


CCG 


CCG 


TTG 


CGC 


TGC 


TGG 


CCG 


AAC CAT 


CAG 


CGA 


CCG 


GCG 


CGT 


CGG 


ACC 


CGT 


cys arg 


gly 


pro 


pro 


leu 


arg 


cys 


trp 


pro 


asn his 


gin 


arg 


pro 


ala 


arg 


arg 


thr 


arg 


782/261 


















812/271 


















GCG CGG 


CCA 


GCG 


AAG 


TGG 


CGA 


GGA 


CGG 


TCG 


GTT CGG 


TCG 


CCA 


AGT 


CGA 


TGG 


GCG 


ACT 


ACC 


ala arg 


pro 


ala 


lys 


trp 


arg 


gly 


arg 


ser 


val arg 


ser 


pro 


ser 


arg 


trp 


ala 


thr 


thr 


842/281 


















872/291 


















TGG ATT 


CAC 


ACC 


CAG 


AGA 


CCA 


ACC 


AGG 


TGA 


TGA CCG 


CGG 


TCT 


TGC 


AGC 


AGC 


AGG 


TAG 


GGC 


trp ile 


his 


thr 


gin 


arg 


pro 


thr 


arg)OPA 


OPAJpro 


arg 


ser 


cys 


ser 


ser 


arg) AMB (gly 


902/301 








SEQ : 


ZD NO: 22 


"932/311 






SEQ : 


ID NO: 23 




CGG GGT 


CGG 


TCG 


CAT 


CGC 


TGA 


AGG 


CCC 


ATT 


TCG AGG 


CGA 


ATC 


CCA 


AGG 


TCG 


CAT 


CGG 


ATC 


arg gly 


arg 


ser 


his 


arg)OPA(arg 


pro 


ile 


ser arg 


arg 


ile 


pro 


arg 


ser 


his 


arg 


ile) 








CSEQ 


ID NO: ! 


24 



















SEQ ID NOS: 14-24 

FIG. 1B' 



SeqlC : Insert of the DP428 clone, other reading frame 



3/l^SEQ ID 


NO: 


25 












33/11 


















TCG CCT 


TTG 


ACG 


CCT 


ATT 


CGG 


TCG 


CGC 


AGC 


TTT TTG 


GCG 


ACG 


TCA 


CTG 


GTG 


CCC 


GCG 


TTG 


(ser pro 


leu 


thr 


pro 


ile 


arg 


ser 


arg 


ser 


phe leu 


ala 


thr 


ser 


leu 


val 


pro 


ala 


leu 


63/21^SEQ ID NO: 26 










93/31 


















CGG GCG 


TCC 


AGC 


CGC 


AGC 


GAC 


ACC 


ACA 


TAC 


GGC CGG 


TCC 


GGT 


TGC 


GGG 


GGC 




TGG 


GTG 


arg ala 


ser 


ser 


arg 


ser 


asp 


thr 


thr 


tvr 


y J- y ci-l y 


ser - 


• y J- jr 


cys 


y i y 


y -Ly 


oi y 


trp 


val 


123/41 


















153/51 


















GGG TTG 


GGT 


GCC 


TCC 


GTC 


ACC 


CCA 


GGC 


AGT 


TCG CTG 


GCT 


ATT 


^TGT 


CGC 


AGT 


AGC 


GCG 


ACG 


gly leu 


gly 


ala 


ser 


val 


thr 


pro 


qlv 


ser 


ser leu 


ala 


ile 


cys 


arg 


ser 


ser 


ala 


thr 


183/61 


















213/71 


















GCA TTG 


TCG 


ATG 


TCT 


TGG 


TAG 


CTA 


GCA 


TCC 


GGT CGG 


GGG 


GCC 


GCT 


ACC 


AGC 


GCC 


AGC 


GCC 


ala leu 


ser 


met 


ser 


trp)AMB(leu 


ala 


ser 


gly arg 


crlv 

=3 Jr 


ala 


ala 


thr 


ser x 


-ala 


ser 


ala 


243/81 


SEQ ID NO: 27 9 








273/91 


















GGG GCT 


ccc 


CGG 


TCC 


GGG 


TAG 


TGC 


GCG 


TCG 


AGT TGG 


TCG 


TGG 


ACC 


AGC 


AAT 


GAC 


TGC GAC 


gly ala 


pro 


arg 


ser 


gly ) AMB ( cys 


ala 


ser 


ser trp 


ser 


trp 


thr 


ser 


asn 


asp 


cys v 


asp. 


303/101 






SEQ ID NO: 


28- 


J 




333/111 
















CCG GCG 


ACT 


TCG 


AAA 


CCG 


CCA 


CCG 


GTT 


AGA 


TTC CCC 


GAC 


TGC 


GTC 


ATC 


GCC 


AGG 


TAA 


ACC 


pro ala 


thr 


ser 


lys 


pro 


pro 


oro 


val 


arg 


phe pro 


aso 


cys 


val 


ile 


ala 


arg)OCH(thr 


363/121 


















393/131 




SEQ ID NO: 29- 






GCC GGC 


ACT 


AAC 


GCC 


AGC 


AAC 


CAA 


CCC 


GTG 


AAG ACC 


AAC 


CAA 


CGG 


CAC 


CTG 


CGC 


AGG 


TTG 


ala gly 


thr 


asn 


ala 


ser 


asn 


crln 


oro 


val 


lys thr 


asn 


gin 


arg 


his 


leu 


arg 


arg 


leu 


423/141 


















453/151 


















CGG CTC 


AAC 


CGC 


ATC 


ATG 


AAC 


TGC 


TGG 


ATT 


TCG GAC 


TCC 


CCG 


TAC 


TCT 


CGC 


GCA 


GTG 


CGT 


arg leu 


asn 


arg 


ile 


met 


asn 


cys 


trp 


ile 


ser asp 


ser 


pro 


tyr 


ser 


arg 


ala 


val 


arg 


483/161 


















513/171 


















GCC CGC 


GAG 


CCT 


ACC 


GAA 


GAT 


CGC 


GTG 


CAT 


GCG TTC 


GGC 


GTG 


GAC 


CGC 


ACA 


GCA 


CCT 


GGA 


ala arg 


glu 


pro 


thr 


glu 


asp 


arg 


val 


his 


ala phe 


crlv 


val 


asp 


arg 


thr 


ala 


pro 


gly 


543/181 


















573/191 


















GTT GGC 


GGC 


GCC 


GAG 


GGC 


CGA 


GAT 


GGC 


AGG 


ATG ACG 


GAT 


CGT 


CGG 


GGG 


CGG 


GAA 


CTC 


CCA 


val gly 


gly 


ala 


glu 


gly 


arg 


asp 


crlv 


arcr 


met thr 


asp 


arg 


arg 


gly 


arg 


glu 


leu 


pro 


603/201 


















633/211 


















GGC CGC 


CGG 


ACC 


GTC 


GCA 


AAC 


CCG 


TCG 


CAA 


ACC CGT 


CGC 


AAA 


CCG 


TAA 


GGA 


GTC 


ATC 


CAT 


gly arg 


arg 


thr 


val 


ala 


asn 


pro 


ser 


gin 


thr arg 


arg 


lys 


pro)OCH(gly^yal 


ile 


his 


663/221 


















693/231 




SEQ ID NO: 








GAA GAC 


AGG 


CAC 


CGC 


GAC 


GAC 


GCG 


GCG 


CAG 


GCT GTT- 


GGC 


AGT 


ACT 


GAT 


CGC 


CCT 


CGC 


GTT 


glu asp 


arg 


his 


arg 


asp 


asp 


ala 


ala 


gin 


ala val 


crlv 


ser 


thr 


asp 


arg 


pro 


arg 


val 


723/241 


















753/251 


















GCC GGG 


GGC 


CGC 


CGT 


TGC 


GCT 


GCT 


GGC 


CGA 


ACC ATC 


AGC 


GAC 


CGG 


CGC 


GTC 


GGA 


CCC 


GTG 


ala gly 


gly 


arg 


arg 


cys 


ala 


ala 


gly 


arg 


thr ile 


ser 


asp 


arg 


arg 


val 


gly 


pro 


val 


783/261 


















813/271 


















CGC GGC 


CAG 


CGA 


AGT 


GGC 


GAG 


GAC 


GGT 


CGG 


TTC GGT 


CGC 


CAA 


GTC 


GAT 


GGG 


CGA 


CTA 


CCT 


arg gly 


gin 


arg 


ser 


gly 


glu 


asp 


gly 


arg 


phe gly 


arg 


gin 


val 


asp 


gly 


arg 


leu 


pro 


843/281 


















873/291 


















GGA TTC 


ACA 


CCC 


AGA 


GAC 


CAA 


CCA 


GGT 


GAT 


GAC CGC 


GGT 


CTT 


GCA 


GCA 


GCA 


GGT 


AGG 


GCC 


gly phe 


thr 


pro 


arg 


asp 


gin 


pro 


gly 


asp 


asp arg 


gly 


leu 


ala 


ala 


ala 


gly 


arg 


ala 


903/301 


















933/311 


















GGG GTC 


GGT 


CGC 


ATC 


GCT 


GAA 


GGC 


CCA 


TTT 


CGA GGC 


GAA 


TCC 


CAA 


GGT 


CGC" 


ATC 


GGA 


TC 


gly val 


gly 


arg 


ile 


ala 


glu 


gly 


pro 




arg gly 


glu 


ser 


gin 


gly 


arg 


ile 


gly) 





SEQ XD NOS: 25-30 



FIG. 1C 



ng sequence DP428 identical to the Rv0203 predicted by Cole et al . 
(Nature 393:537-544) 31/11 

1/1^ SEQ ID HO! 31 GTA CTG ATC gcc CTC GCG 

Sf£ S S S S S S S S » 3 - - - - - - l » - i - 

SH5 "S 2 S 2 S S S 2 £ S 2 S £ = = 2 £ 

22 s 2 2 s ss s s £ = 2 s s 2 = £ s 2 

181/61 „ »or lt r nr TTG ATG ACC GCG GTC TTG CAG CAG CAG GTA GGG 

SS Xp 2 2 .2 2 2 2JL v.. X« .!» .In 1» «! .1, 

241/81 nrrr aap CCC CAT TTC GAG GCG AAT CCC AAG GTC GCA TCG GAT 

2 2 2 2 2 2 2 "2 2 2 |e gu =1= - Pro ly- val - -~ « 

CTg'cAC GCG CTT TCG CAA CCG CTG ACC GAT CTT TCG ACT CGG TGC TCG CTG CCG ATC AGC 

leu his ala leu ser gin pro leu thr asp Ij^J" ttir arg 

ss 2 s s 2 s = 2 = - - - = ~ ss-s. 

SEQ ID NOS: 31-32 

FIG. 1D' 

ORF containing the DP428 sequence and forming part of seqlA' 

1/1 ^SEQ ID NO: 33 Q TCG CAA ACC C GT CGC AAA 

TGA^CGG ATC GTC GGG GGC GGG AAC TCC CAG GCC GCC GGA CCG ^ ^ 

OPA(arg ile val gly gly ffly «™ ser gin ala ala gly P r ° i/31 9 

« SsHb sssi 2 = s 22 2 2 s s s 2 s 2 
si- s 2 s ss s s 2 2 s s s s s 2 2 S Si 

2 2 S 2 2 S 2 2 2 SS SS 2 2 2 S 2 S 55 2 



s s 2 s £ s ss 2 22 s 2 2 s 2 ss 2 2 

SSS 2 5= 2 - 3 S S £2 2 S = = - S S 2- 

361/121 r rp r rar gcg CTT TCG CAA CCG CTG ACC GAT CTT 

2 s 2 2 s 2 s 2 2 s 22 - ~ °- - - - *^ 

421/141 TG CAG GCG atC GGT TTG 

TCG ACT CGG TGC TCG CTG CCG ATC AGC GGC CTG CAO 

ser thr arg cys ser leu pro ile ser gly leu gin ala gly J 

481/161 

CAG GGC GCC CGC CGG TAG 

gin gly ala arg arg)AMB * — 

SEQ ID NOt 

FIG. 1 




GCGGTCGGGGGGCCGCTACCAGCGCCAGCGCCGGGGCTCCCCGGTCCGGGTA GTG CGC GTC GAG TTG GTC GTG 563 
k <i SEQ ID NO: 35 SEQ ID NO: 37 — ^ <V R V E L V V 7 

GAC CAG CAA TGA CTGCGACCCGGCGACTTCGAAACCGCCACCGGTTAGATTCCCCGACTGCGTCATCGCCAGGTAA 639 
D Q Q) SEQ ID NO: 36 



640 ACCGCCGGCACTAACGCCAGCAACCAACCC GTG AAG ACC AAC CAA CGG CAC CTG CGC AGG TTG CGG 
1 SEQ ID NO: 38 (VKTNQRHLRRLR 



705 
12 



706 CTC AAC CGC ATC ATG AAC TGC TGG ATT TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT GCC 
13LNRIMNCWI SDSPYSRAVRA 



765 
32 



766 CGC GAG CCT ACC GAA GAT CGC GTG CAT GCG 
33 REPTEDRVHA 

826. GGC GGC GCC GAG GGC CGA GAT GGC AGG ATG 
53GGAEGRDGRM 

886 CGC CGG ACC GTC GCA AAC CCG TCG CAA ACC 
73RRT\VANPSQT 

947 ACA GGC ACC GCG ACG ACG CGG CGC AGG CTG 
3TGTATTRRRL 

1007 GGG GCC GCC GTT GCG CTG CTG GCC GAA CCA 
23 GAAVALLAEP 

1067 GCC AGC GAA GTG GCG AGG ACG GTC GGT TCG 
43 ASEVARTVGS 

1127 TCA CAC CCA GAG ACC AAC CAG GTG ATG ACC 
63SHPETNQVMT 

1187 TCG GTC GCA TCG CTG AAG GCC CAT TTC GAG 
83 SVASLKAHFE 

1247 GCG CTT TCG CAA CCG CTG ACC GAT CTT TCG 
103 ALSQPLTDLS 

13 07 CAG GCG ATC GGT TTG ATG CAG GCG GTG CAG 
123 QAIGLMQAVQ 

13 67 CGG GTC CGG CGC AGT CGA CGT GAG GCA GCG 

6RVRRSRREAA 

1427 TCT GGT CGC AGG TCA GGG GTC GGC GCT GGA 
26 SGRRSGVGAG 

14 87 CAG GGT GTG CCC TGC GGT TGG ATG ACA AGT 
46 QGVPCGWMTS 

1547 TCG TTG T 
66S It) 



TTC GGC GTG GAC CGC ACA GCA CCT GGA GTT 82 5 

FGVDRTAPGV 52 

ACG GAT CGT CGG GGG CGG GAA CTC CCA GGC 885 

TDRRGRELPG 72 

CGT CGC AAA CCG TAA GGAGTCATCC ATG AAG 94 6 

R R K P) * xxxxxx (M K 2 

SEQ ID NO: 39 

TTG GCA GTA CTG ATC GCC CTC GCG TTG CCG 1006 

LAVLIALALP 22. 

TCA GCG ACC GGC GCG TCG GAC CCG TGC GCG 1066 

SATGASDPCA 42 

GTC GCC AAG TCG ATG GGC GAC TAC CTG GAT 112 6 

VAKSMGDYLD 62 

GCG GTC TTG CAG CAG CAG GTA GGG CCG GGG 1186 

AVLQQQVGPG 82 

GCG AAT CCC AAG GTC GCA TCG GAT CTG CAC 124 6 

ANPKVASDLH 102 

ACT CGG TGC TCG CTG CCG ATC AGC GGC CTG 13 06 

TRCSLPISGL 122 

GGC GCC CGC CGG TAG ATG CCG GAC CGC CGC 13 66 

GARR)*^(MPDRR 5 

SEQ ID NO: 40 

GTC GCC TAC CGG GGC GGT GTC TCG CCG CCT 14 26 

VAYRGGVSPP 25 

CCT TGC GGT GTG GTT TCG ACC GGG TCG TCG 1486 

PCGVVSTGSS 45 

CGC AGG TTT GGA TCG GTT GGC GGG TCG CGA 154 6 

RRFGSVGGSR 65 

1553 
67 



SEQ XD NOS: 35-40 



FIG. 2 



/SEQ XD NO: 41 

TCG CCG GCT CGC GGA CGT AGA TAA TAG CTC 
( ser p ro ala arg gly arg arg)OCH AMBJleu 
61/2 1**""" SEQ ID NO: 42 SEQ ID NO: ^43 
TGA CTG CCG GGC, TTG ACG CGG ACG ACC ACA 
OPA(leu pro gly leu thr arg thr thr thr 
121/TT^SEQ ID NO: 44 

TGA CCT GGG GTG CGT GGG CGC CGA CGA GTG 
OPA(pro gly val arg gly arg arg arg val 
181/61 SEQ ID NO: 45 

CTC GGT CGC CGG CAG TGT CAG CAT GTG CAG 
leu qly arq arq qln cys gin his val gin 
241/81 

TCG TGG TTG CGA CGA CTT GGC GCT GGT GAG 

ser trp leu arg arg leu gly ala gly glu 

301/101 

GAT C 

asp) 



31/11 

ACC GTT GGA CGA CCT CGA CAG GGT CCT TTG 

thr val gly arg pro arg gin gly pro leu) 
91/31 

GAG TCG GGT CAT CGC CTA AGG CTA CCG TTC 

glu ser gly his arg leu arg leu pro phe) 
151/51 

AGG CAG TCA TGT CTC AGG GCC CAC CGC CAC 

arg gin ser cys leu arg ala his arg his 
211/71 

ATG ACT CCA CGC AGC TTG TTC GTG TTG GTG 

met thr pro arg ser leu phe val leu val 
271/91 

CGC ACC CGC CGG CGT CGT GCC GCG CAT GCG 

arg thr arg arg arg arg ala ala his ala 



SEQ XD NOS: 41-45 

FIG. 3 A 



^■SEQ ID NO: 46 32/11 

CGC CGG CTC GCG GAC GTA GAT AAT AGC TCA CCG TTG GAC GAC CTC GAC AGG GTC CTT TGT 
( arg arg leu ala asp val asp asn ser ser pro leu asp asp leu asp arg val leu cys 
62/21 "SEQ ID NO: 47 92/31 

GAC TGC CGG GCT TGA CGC GGA CGA CCA CAG AGT CGG GTC ATC GCC TAA GGC TAC CGT TCT 
asp cys arg ala)OPA(arg gly arg pro gin ser arg val ile ala)OCH(gly tyr arg ser 
122/41 V SEQ ID NO: 48 152/51 ^SEQ ID NO: 49 

GAC CTG GGG TGC GTG GGC GCC GAC GAG TGA GGC AGT CAT GTC TCA GGG CCC ACC GCC ACC 
asp leu gly cys val gly ala asp glu)OPA(gly ser his val ser gly pro thr ala thr 
182/61 SEQ ID NO: 50 ^212/71 

TCG GTC GCC GGC AGT GTC AGC ATG TGC AGA TGA CTC CAC GCA GCT TGT TCG TGT TGG TGT 
ser val ala gly ser val ser met cys arg)OPA(leu his ala ala cys ser cys trp cys 
242/81 272/91"- SEQ ID NO: 51 

CGT GGT TGC GAC GAC TTG GCG CTG GTG AGC GCA CCC GCC GGC GTC GTG CCG CGC ATG CGG 

arg gly cys asp asp leu ala leu val ser ala pro ala gly val val pro arg met arg 

302/101 

ATC 

ile) 



SEQ ID NOS: 46-51 



FIG. 3B 



ZD NO: 52 

GCC GGC TCG CGG ACG TAG ATA 
( ala g ly ser arg thr)AMB(ile 
63/21^ SEQ ID NO: 53 <^SEQ 

ACT GCC GGG CTT GAC GCG GAC 
thr ala gly leu asp ala asp 
123/41 

ACC TGG GGT GCG TGG GCG CCG 
thr trp gly ala trp ala pro 
183/61 

CGG TCG CCG GCA GTG TCA GCA 
arg ser pro ala val ser ala 
243/81 

GTG GTT GCG ACG ACT TGG CGC 
val val ala thr thr trp arg 

TC 



33/11 

ATA GCT CAC CGT TGG ACG 
ile ala his arg trp thr 
ID NO: 54 93/31 
GAC CAC AGA GTC GGG TCA 
asp his arg val gly ser 

153/51 
ACG AGT GAG GCA GTC ATG 
thr ser glu ala val met 

213/71 
TGT GCA GAT GAC TCC ACG 
cys ala asp asp ser thr 

273/91 
TGG TGA GCG CAC CCG CCG 
trp)OPA(ala his pro pro 
SEQ ID NO: 55 



ACC TCG ACA GGG TCC TTT GTG 
thr ser thr gly ser phe val 

TCG CCT AAG GCT ACC GTT CTG 
ser pro lys ala thr val leu 

TCT CAG GGC CCA CCG CCA CCT 
ser gin gly pro pro pro pro 

CAG CTT GTT CGT GTT GGT GTC 
gin leu val arg val gly val 

GCG TCG TGC CGC GCA TGC GGA 
ala ser cys arg ala cys gly) 



SEQ ID NOS: 52-55 

FIG. 3C 



SEQ ID NO: 56 



CCA ATT 


TTC 


CTT 


CGC 


GCC 


GTG 


CAA 


TAC 


CAT 


(pro ile 


phe 


leu 


arg 


ala 


val 


gin 


tyr 


his 


61/2 SEQ ID NO: 57 










GTC GCG 


CAG 


CTT 


GCG 


GAA 


ACC 


GGG 


TAT 


GGA 


val ala 


gin 


leu 


ala 


glu 


thr 


gly 


tyr 


gly 


121/41 


















CGT CGC 


TCT 


CCA 


CCC 


GTC 


GGG 


GGG 


CGA 


AAG 


arg arg 


ser 


pro 


pro 


val 


gly 


gly 


arg 


lys 


181/61 


















GGC TGA 


GTG 


TCT 


GCA 


GGG 


CTC 


CGG 


GGA 


GCA 


gly)OPA(val 


ser 


ala 


gly 


leu 


arg 


gly 


ala 


241/81 


^-SEQ ID NO: 59 








GTC CCC 


CGC 


GCG 


CGA 


CTT 


CCA 


GAC 


ATT 


TGT 


(val pro 


arg 


ala 


arg 


leu 


pro 


asp 


ile 


cys 


301/101 


- SEQ ID NO: 


60 








CAT. TTC 


GCA 


GCA 


ACC 


GGT 


CTC 


CGG 


GTC 


GCA 


his phe 


ala 


ala 


thr 


gly 


leu 


arg 


val 


ala 


361/121 


















CGG ACG 


AGT 


CGT 


CGT 


CAA 


CGA 


CCA 


CGA 


TC 


arg thr 


ser 


arg 


arg 


gin 


arg 


pro 


arg) 





31/11 


















CTG CAA 


GAC 


CAG 


CGA 


CGG 


CCC 


GTG 


GTT 


GCG 


leu gin 


asp 


gin 


arg 


arg 


pro 


val 


val 


ala 


91/31 


















CCC TGC 


CGT 


ACC 


GTT 


GTT 


GCC 


ACT 


TGA 


TGT 


pro cys 


arg 


thr 


val 


val 


ala 


thr )OPA(cys 


151/51 






SEQ ID NO: 


58- > 




CCA TTC 


CGA 


CAC 


TGG 


GAT 


CCT 


CAA 


AAC 


GTC 


pro phe 


arg 


his 


trp 


asp 


pro 


gin 


asn 


val 


211/71 


















GCC GAT 


CAT 


CAC 


CAT 


GTA 


CGA 


ACT 


GAA 


TAA 


ala asp 


his 


his 


his 


val 


arg 


thr 


glu)OCH 


271/91 


















TGT GGT 


TTC 


GGT 


TGA 


GGC 


CGA 


GGC 


GAG 


GCT 


cys gly 


phe 


gly)OPA(gly 


arg 


gly 


glu 


ala 


331/111 








C-SEQ ID NO: 


61 


GCA TCG 


TTG 


CGG 


CGA 


TCG 


CGG 


CGC 


AGT 


CGT 


ala ser 


leu 


arg 


arg 


ser 


arg 


arg 


ser 


arg 



SEQ ID NOS: 56-61 



FIG. 4A 



SEQ ID NO: 62 32/11 



CAA TTT 


TCC 


TTC 


GCG 


CCG 


TGC 


AAT 


ACC 


ATC 


TGC AAG 


ACC 


AGC 


GAC 


GGC 


CCG 


TGG 


TTG 


CGG 


gln(phe 


ser 


phe 


ala 


pro 


cys 


asn 


thr 


ile 


cys lys 


thr 


ser 


asp 


gly 


pro 


trp 


leu 


arg 


62/21*— 


— SE< 


J ID 


NO: 


63 








92/31 


















TCG CGC 


AGC 


TTG 


CGG 


AAA 


CCG 


GGT 


ATG 


GAC 


CCT GCC 


GTA 


CCG 


TTG 


TTG 


CCA 


CTT 


GAT 


GTC 


ser arg 


ser 


leu 


arg 


lys 


pro 


gly 


met 


asD 


pro ala 


val 


pro 


leu 


leu 


pro 


leu 


asp 


val 


122/41 


















152/51 


















GTC GCT 


CTC 


CAC 


CCG 


TCG 


GGG 


GGC 


GAA 


AGC 


CAT TCC 


GAC 


ACT 


GGG 


ATC 


CTC 


AAA 


ACG 


TCG 


val ala 


leu 


his 


pro 


ser 


crlv 


crlv 

33 -*- Jr 


glu 


ser 


his ser 


asp 


thr 


y J- y 


ile 


1 eu 




thr 


ser 


182/61 


















212/71 


















GCT GAG 


TGT 


CTG 


CAG 


GGC 


TCC 


GGG 


GAG 


CAG 


CCG ATC 


ATC 


ACC 


ATG 


TAC 


GAA 


CTG 


AAT 


AAG 


ala glu 


cys 


leu 


gin 


gly 


ser 


gly 


glu 


gin 


pro ile 


ile 


thr 


met 


tyr 


glu 


leu 


asn 


lys 


242/81 


















272/91 


















TCC CCC 


GCG 


CGC 


GAC 


TTC 


CAG 


ACA 


TTT 


GTT 


GTG GTT 


TCG 


GTT 


GAG 


GCC 


GAG 


GCG 


AGG 


CTC 


ser pro 


ala 


arg 


asp 


phe 


gin 


thr 


phe 


val 


val val 


ser 


val 


glu 


ala 


glu 


ala 


arg 


leu 


302/101 


















332/111 


















ATT TCG 


CAG 


CAA 


CCG 


GTC 


TCC 


GGG 


TCG 


CAG 


CAT CGT 


TGC 


GGC 


GAT 


CGC 


GGC 


GCA 


GTC 


GTC 


ile ser 


gin 


gin 


pro 


val 


ser 


gly 


ser 


gin 


his arg 


cys 


gly 


asp 


arg 


gly 


ala 


val 


val 


362/121 




































GGA CGA 


GTC 


GTC 


GTC 


AAC 


GAC 


CAC 


GAT 


C 




















gly arg 


val 


val 


val 


asn 


asp 


his 


asp) 























SEQ ZD NOS: 62-63 

FIG. 4B 



SEQ ID NO: 64 33/11 

AAT TTT CCT TCG CGC CGT GCA ATA CCA TCT GCA AGA CCA GCG ACG GCC CGT GGT TGC GGT 
( asn p he pro ser arg arg ala ile pro ser ala arg pro ala thr ala arg gly cys gly 
63/ 21"-^- SEQ ID NO: 65 93/31 

CGC GCA GCT TGC GGA AAC CGG GTA TGG ACC CTG CCG TAC CGT TGT TGC CAC TTG ATG TCG 

arg ala ala cys gly asn arg val trp thr leu pro tyr arg cys cys his leu met ser 
123/41 153/51 

TCG CTC TCC ACC CGT CGG GGG GCG AAA GCC ATT CCG ACA CTG GGA TCC TCA AAA CGT CGG 

ser leu ser thr arg arg gly ala lys ala ile pro thr leu gly ser ser lys arg arg 
183/61 213/71 

CTG AGT GTC TGC AGG GCT CCG GGG AGC AGC CGA TCA TCA CCA TGT ACG AAC TGA ATA AGT 

leu ser val cys arg ala pro gly ser ser arg ser ser pro cys thr asn)OPA(ile ser 

243/81 273/91 SEQ ID NO: 66 0 

CCC CCG CGC GCG ACT TCC AGA CAT TTG TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC TCA 

pro pro arg ala thr ser arg his leu leu trp phe arg leu arg pro arg arg gly ser 
303/101 333/111 

TTT CGC AGC AAC CGG TCT CCG GGT CGC AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG TCG 

phe arg ser asn arg ser pro gly arg ser ile val ala ala ile ala ala gin ser ser 
363/121 

GAC GAG TCG TCG TCA ACG ACC ACG ATC 
asp glu ser ser ser thr thr thr ile) 



SEQ XD NOS : 64-66 



FIG.4C 



6 WEO 



part of the nucleotide sequence of seq4A 



1^1 ^SEQ ID NO: 67 31/11 

CCG CGC GCG ACT TCC AGA CAT TTG TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC TCA TTT 
(p ro ar g ala thr ser arg his leu leu trp phe arg leu arg pro arg arg gly ser phe 
61/21 SEQ ID NOs 68 91/31 

CGC AGC AAG CGG TCT CCG GGT CGC AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG TCG GAC 
arg ser lys arg ser pro gly arg ser ile val ala ala ile ala ala gin ser ser asp 
121/41 

GAG TCG TCG TCA ACG ACC ACG ATC 
glu ser ser ser thr thr thr ile) 



SEQ ID NOS: 67-68 

FIG.4A' 



1/1 ^-SEQ ID NOs 69 31/11 

CGC GCG CGA CTT CCA GAC ATT TGT TGT GGT TTC GGT TGA GGC CGA GGC GAG GCT CAT TTC 
(a rg ala arg leu pro asp ile cys cys gly phe gly)OPA(gly arg gly glu ala his phe 
61/21 -SEQ ID NO: 70 91/31 ^SEQ ID NO: 71 

GCA GCA AGC GGT CTC CGG GTC GCA GCA TCG TTG CGG CGA TCG CGG CGC AGT CGT CGG ACG 
ala ala ser gly leu arg val ala ala ser leu arg arg ser arg arg ser arg arg thr 
121/41 

AGT CGT CGT CAA CGA CCA CGA TC 
ser arg arg gin arg pro arg) 



SEQ ID NOS: 69-71 



FIG.4B' 



1/1 ^xSEQ ID NO: 72 

GCC GCG CGC GAC TTC CAG ACA 
( ala al a arg asp phe gin thr 
6l./21^SEQ ID NO: 73 
TCG CAG CAA GCG GTC TCC GGG 
ser gin gin ala val ser gly 
121/41 

CGA GTC GTC GTC AAC GAC CAC 
arg val val val asn asp his 



r 

31/11 

TTT GTT GTG GTT TCG GTT 
phe val val val ser val 
91/31 

TCG CAG CAT CGT TGC GGC 
ser gin his arg cys gly 

GAT C 
asp 



GAG GCC GAG GCG AGG CTC ATT 
glu ala glu ala arg leu ile 

GAT CGC GGC GCA GTC GTC GGA 
asp arg gly ala val val gly 



SEQ ID NOS: 72-73 



FIG. 4C" 




ORF according to Cole et al . (Nature 393:537-544) and containing the 
sequence Seq 4A' 



1/1 ^-SEQ ID NO 


: 74 










31/11 


















tga ata 


agt 


ccg 


ccg 


cgc 


gcg 


act 


tec 


aga 


cat ttg 


ttg 


tcrcf 


ttt 


ccrcr 


tta 




ccg 


acrcr 


OPA(ile 


ser 


pro 


pro 


arg 


ala 


thr 


ser 


arg 


his leu 


leu 


trp 


phe 


arg 


leu 


arcr 


nro 


arg 


61/21 SEQ ID NO 


: 75 










91/31 


















cga ggc 


tea 


ttt 


cgc 


age 


aag 


egg 


tct 


ccg 


ggt cgc 


age 


ate 


gtt 


gcg 


gcg 


ate 


gcg 


gcg 


arg gly 


ser 


phe 


arg 


ser 


lys 


arg 


ser 


pro 


gly arg 


ser 


ile 


val 


ala 


ala 


ile 


ala 


ala 


121/41 


















151/51 


















cag teg 


teg 


gac 


gag 


teg 


teg 


tea 


acg 


ace 


acg ate 


teg 


aac 


teg 


acg 


ccc 


tec 


tgt 


teg 


gin ser 


ser 


asp 


glu 


ser 


ser 


ser 


thr 


thr 


thr ile 


ser 


asn 


ser 


thr 


pro 


ser 


cys 


ser 


181/61 


















211/71 


















agg atg 


eta 


cgc 


aga 


cag 


cgc 


teg 


atg 


gtg 


gcg ccg 


ttg 


ttg 


tac 


ate 


ggg 


atg 


cac 


acc 


arg met 


leu 


arg 


arg 


gin 


arg 


ser 


met 


val 


ala pro 


leu 


leu 


tyr 


ile 


gly 


met 


his 


thr 


241/81 


















271/91 


















gag ata 


age 


ggt 


ttc 


gee 


ggg 


ttc 


ace 


gat 


ace acg 


ctt 


gat 


gca 


tea 


cca 


ggc 


ace 


aca 


glu ile 


ser 


gly 


phe 


ala 


gly 


phe 


thr 


asp 


thr thr 


leu 


asp 


ala 


ser 


pro 


gly 


thr 


thr 



301/101 
tgg cga etc aga gac tag 
trp arg leu arg asp) AMB 

SEQ ID NOS: 74-75 

FIG. 4D' 



sequence upstream of seq4A * 
1/1^-SEQ ID NO: 76 
GCA ACC TAC CAG CAG AGC CAG 
(ala thr tyr gin gin ser gin 

^ — SEQ ID NO: 77 

C 



and fused with seq4A' 
31/11 

GGG CTC ACA GGA CCT AAA 
gly leu thr gly pro lys 



SEQ ID NOS: 7 6-77 

FIG. 4E' 



GGA GTA GCG CCC ATG GCT GAT 
gly val ala pro met ala asp) 



seq4J' in another reading frame 

1/1_^-SEQ ID NO: 78 31/11 

ACG CAA CCT ACC AGC AGA GCC AGG GGC TCA CAG GAC CTA AAG GAG TAG CGC CCA TGG CTG 
( thr g in pro thr ser arg ala arg gly ser gin asp leu lys glu)AMB(arg pro trp leu 

61/21" SEQ ID NO: 79 SEQ ID NO: 80 

ATC 
ile) 

SEQ ID NOS:78-80 



FIG. 4F 

seq 4J ( in the third reading frame 

1/1 SEQ ID NO: 81 31/11 

CGC AAC CTA CCA GCA GAG CCA GGG GCT CAC AGG ACC TAA AGG AGT AGC GCC CAT GGC TGA 
(arg asn leu pro ala glu pro gly ala his arg thr)OCH(arg ser ser ala his gly)OPA 
^ SEQ ID NO: 82 SEQ ID NO: 83 

TC 

SEQ ID NOS: 81-83 

FIG. 4G" 



7BSB 

lence Rv2050 predicted by Cole et al . (Nature 393:537-544) and 



containing seq4J 






























1/1^- SEQ ID 


NO: 


84 












31/11 


















ATG GCT 


GAT 


CGT 


GTC 


CTG 


AGG 


GGC 


AGT 


CGC 


CTC GGA 


GCC 


GTG 


AGC 


TAT 


GAG 


ACC 


GAC 


CGC 


(Met ala 


asp 


arg 


val 


leu 


arg 


gly 


ser 


arg 


leu gly 


ala 


val 


ser 


tyr 


glu 


thr 


asp 


arg 


61/21 


^SEQ ID NO: 


85 








91/31 










AAC CAC 


GAC 


CTG 


GCG 


CCG 


CGC 


CAG 


ATC 


GCG 


CGG TAC 


CGC 


ACC 


GAC 


AAC 


GGC 


GAG 


GAG 


TTC 


asn his 


asp 


leu 


ala 


pro 


arg 


gin 


ile 


ala 


arg tyr 


arg 


thr 


asp 


asn 


gly 


glu 


glu 


phe 


121/41 


















151/51 


















GAA GTC 


CCG 


TTC 


GCC 


GAT 


GAC 


GCC 


GAG 


ATC 


CCC GGC 


ACC 


TGG 


TTG 


TGC 


CGC 


AAC 


GGC 


ATG 


glu val 


pro 


phe 


ala 


asp 


asp 


ala 


glu 


ile 


pro gly 


thr 


trp 


leu 


cys 


arg 


asn 




met 


181/61 


















211/71 


















GAA GGC 


ACC 


CTG 


ATC 


GAG 


GGC 


GAC 


CTG 


CCC 


GAG CCG 


AAG 


AAG 


GTT 


AAG 


CCG 


CCC 


CGG 


ACG 


glu gly 


thr 


leu 


ile 


glu 


gly 


asp 


leu 


pro 


glu pro 


lys 


lys 


val 


lys 


pro 


pro 


arg 


thr 


241/81 


















271/91 


















CAC TGG 


GAC 


ATG 


CTG 


CTG 


GAG 


CGC 


CGT 


TCC 


ATC GAA 


GAA 


CTC 


GAA 


GAG 


TTA 


CTT 


AAG 


GAG 


his trp 


asp 


met 


leu 


leu 


glu 


arg 


arg 


ser 


ile glu 


glu 


leu 


glu 


glu 


leu 


leu 


lys 


glu 


301/101 


















331/111 


















CGC CTC 


GAG 


CTC 


ATT 


CGG 


TCA 


CGT 


CGG 


CGC 


GGC TGA 


















arg leu 


glu 


leu 


ile 


arg 


ser 


arg 


arg 


arg 


gly)OPA 



















SEQ XD NOS: 84-85 

FIG. 4H" 

ORF according to Cole et al . (Nature 393:537-544) and containing the 
sequence Rv2 050 

1 /1 ^ — SEQ ID NO: 86 31/11 

TAG TCC GCC CGG GTG TCC GAT CCC GGT ATC ATT GAT GGT CGC GCC GCG CGC GTC GCG TGC 
AMB ( ser ala arg val ser asp pro gly ile ile asp gly arg ala ala arg val ala cys 
61/21" SEQ ID NO: 87 91/31 

CGG GAA CTA CGC AGA CGG CCG CAG CGT TTG CCA ACC GGA GCC AGT CGC CAG TAC GCA ACC 
arg glu leu arg arg arg pro gin arg leu pro thr gly ala ser arg gin tyr ala thr 
121/41 151/51 

TAC CAG CAG AGC CCA GGG CTC ACA GGA CCT AAA GGA GTA GCG CCC ATG GCT GAT CGT GTC 
tyr gin gin ser pro gly leu thr gly pro lys gly val ala pro met ala asp arg val 
181/61 211/71 

CTG AGG GGC AGT CGC CTC GGA GCC GTG AGC TAT GAG ACC GAC CGC AAC CAC GAC CTG GCG 
leu arg gly ser arg leu gly ala val ser tyr glu thr asp arg asn his asp leu ala 
241/81 271/91 

CCG CGC CAG ATC GCG CGG TAC CGC ACC GAC AAC GGC GAG GAG TTC GAA GTC CCG TTC GCC 
pro arg gin ile ala arg tyr arg thr asp asn gly glu glu phe glu val pro phe ala 
301/101 331/111 

GAT GAC GCC GAG ATC CCC GGC ACC TGG TTG TGC CGC AAC GGC ATG GAA GGC ACC CTG ATC 
asp asp ala glu ile pro gly thr trp leu cys arg asn gly met glu gly thr leu ile 
361/121 391/131 

GAG GGC GAC CTG CCC GAG CCG AAG AAG GTT AAG CCG CCC CGG ACG CAC TGG GAC ATG CTG 
glu gly asp leu pro glu pro lys lys val lys pro pro arg thr his trp asp met leu 
421/141 451/151 

CTG GAG CGC CGT TCC ATC GAA GAA CTC GAA GAG TTA CTT AAG GAG CGC CTC GAG CTC ATT 
leu glu arg arg ser ile glu glu leu glu glu leu leu lys glu arg leu glu leu ile 
481/161 

CGG TCA CGT CGG CGC GGC TGA 
arg ser arg arg arg gly)OPA 

SEQ XD NOS 86-87 



FIG. 41' 




SEQ ID NO: 


88 














31/11 


















GAT CGC 


GGT 


CAA 


CGA 


GGC 


CGA 


ATA 


CGG 


CGA 


GAT GTG 


GGC 


CCA 


AGA 


CGC 


CGC 


CGC 


GAT 


GTT 


(asp arg 


gly 


gin 


arg 


gly 


arg 


ile 


arg 


arg 


asp val 


gly 


pro 


arg 


arg 


arg 


arg 


asp 


val 


61/237" — 


— SEQ ID 


NO: 


89 










91/31 


















TGG CTA 


CGC 


CGC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GGC GAC 


GTT 


GCT 


GCC 


GTT 


CGA 


GGA 


GGC 


GCC 


trp leu 


arg 


arg 


gly 


asp 


gly 


asp 


gly 


asp 


gly asp 


val 


ala 


ala 


val 


arg 


gly 


gly 


ala 


121/41 


















151/51 


















GGA GAT 


GAC 


CAG 


CGC 


GGG 


TGG 


GCT 


CCT 


CGA 


GCA GGC 


CGC 


CGC 


GGT 


CGA 


GGA 


GGC 


CTC 


CGA 


gly asp 


asp 


gin 


arg 


gly 


trp 


ala 


pro 


arg 


ala gly 


arg 


arg 


gly 


arg 


gly 


gly 


leu 


arg 


181/61 


















211/71 


















CAC CGC 


CGC 


GGC 


GAA 


CCA 


GTT 


GAT 


GAA 


CAA 


TGT GCC 


CCA 


GGC 


GCT 


GCA 


ACA 


GCT 


GGC 


CCA 


his arg 


arg 


gly 


glu 


pro 


val 


asp 


glu 


gin 


cys ala 


pro 


gly 


ala 


ala 


thr 


ala 


gly 


pro 


241/81 


















271/91 


















GCC CAC 


GCA 


GGG 


CAC 


CAC 


GCC 


TTC 


TTC 


CAA 


GCT GGG 


TGG 


CCT 


GTG 


GAA 


GAC 


GGT 


CTC 


GCC 


ala his 


ala 


gly 


his 


his 


ala 


phe 


phe 


gin 


ala gly 


trp 


pro 


val 


glu 


asp 


gly 


leu 


ala 


301/101 




































GCA TCG 


GTC 


GCC 


GAT 


C 




























ala ser 


val 


ala 


asp) 































SEQ ID NOSo 88-89 

FIG. 5A 



f SEQ ID NO: 


90 












32/11 


















ATC GCG 


GTC 


AAC 


GAG 


GCC 


GAA 


TAC 


GGC 


GAG 


ATG TGG 


GCC 


CAA 


GAC 


GCC 


GCC 


GCG 


ATG 


TTT 


ile ala 


val 


asn 


glu 


ala 


glu 


tyr 


gly 


glu 


met trp 


ala 


gin 


asp 


ala 


ala 


ala 


met 


phe 


62/2 1^- 


SEQ 


ID NO: 91 










92/31 


















GGC TAC 


GCC 


GCG 


GCG 


ACG 


GCG 


ACG 


GCG 


ACG 


GCG ACG 


TTG 


CTG 


CCG 


TTC 


GAG 


GAG 


GCG 


CCG 


gly tyr 


ala 


ala 


ala 


thr 


ala 


thr 


ala 


thr 


ala thr 


leu 


leu 


pro 


phe 


glu 


glu 


ala 


pro 


122/41 


















152/51 


















GAG ATG 


ACC 


AGC 


GCG 


GGT 


GGG 


CTC 


CTC 


GAG 


CAG GCC 


GCC 


GCG 


GTC 


GAG 


GAG 


GCC 


TCC 


GAC 


glu met 


thr 


ser 


ala 


gly 


gly 


leu 


leu 


glu 


gin ala 


ala 


ala 


val 


glu 


glu 


ala 


ser 


asp 


182/61 


















212/71 


















ACC GCC 


GCG 


GCG 


AAC 


CAG 


TTG 


ATG 


AAC 


AAT 


GTG CCC 


CAG 


GCG 


CTG 


CAA 


CAG 


CTG 


GCC 


CAG 


thr ala 


ala 


ala 


asn 


gin 


leu 


met 


asn 


asn 


val pro 


gin 


ala 


leu 


gin 


gin 


leu 


ala 


gin 


242/81 


















272/91 


















CCC ACG 


CAG 


GGC 


ACC 


ACG 


CCT 


TCT 


TCC 


AAG 


CTG GGT 


GGC 


CTG 


TGG 


AAG 


ACG 


GTC 


TCG 


CCG 


pro thr 


gin 


gly 


thr 


thr 


pro 


ser 


ser 


lys 


leu gly 


gly 


leu 


trp 


lys 


thr 


val 


ser 


pro 



302/101 

CAT CGG TCG CCG ATC 
his arg ser pro ile 



SEQ ID NOS. 90-91 



FIG. 5B 



^SEQ ID NO: 92 33/11 



TCG CGG 


TCA 


ACG 


AGG 


CCG 


AAT 


ACG 


GCG 


AGA 


TGT GGG 


CCC 


AAG 


ACG 


CCG 


CCG 


CGA 


TGT 


TTG 


ser (arg 


ser 


thr 


arg 


pro 


asn 


thr 


ala 


arg 


cys gly 


pro 


lys 


thr 


pro 


pro 


arg 


cys 


leu 


63/2 1 SEQ ID NO: 93 














93/31 












GCT ACG 


CCG 


CGG 


CGA 


CGG 


CGA 


CGG 


CGA 


CGG 


CGA CGT 


TGC 


TGC 


CGT 


TCG 


AGG 


AGG 


CGC 


CGG 


ala thr 


pro 


arg 


arg 


arg 


arg 


arg 


arg 


arg 


arg arg 


cys 


cys 


arg 


ser 


arg 


arg 


arg 


arg 


123/41 


















153/51 


















AGA TGA 


CCA 


GCG 


CGG 


GTG 


GGC 


TCC 


TCG 


AGC 


AGG CCG 


CCG 


CGG 


TCG 


AGG 


AGG 


CCT 


CCG 


ACA 


arg)OPA(pro 


ala 


arg 


val 


gly 


ser 


ser 


ser 


arg pro 


pro 


arg 


ser 


arg 


arg 


pro 


pro 


thr 


183/61 


— SEQ ID NO: 94 








213/71 


















CCG CCG 


CGG 


CGA 


ACC 


AGT 


TGA 


TGA 


ACA 


ATG 


TGC CCC 


AGG 


CGC 


TGC 


AAC 


AGC 


TGG 


CCC 


AGC 


pro pro 


arg 


arg 


thr 


ser )OPA 


OPA(thr 


met 


cys pro 


arg 


arg 


cys 


asn 


ser 


trp 


pro 


ser 


243/81 




SEQ ID NO: 95 






273/91 


















CCA CGC 


AGG 


GCA 


CCA 


CGC 


CTT 


CTT 


CCA 


AGC 


TGG GTG 


GCC 


TGT 


GGA 


AGA 


CGG 


TCT 


CGC 


CGC 


pro arg 


arg 


ala 


pro 


arg 


leu 


leu 


pro 


ser 


trp val 


ala 


cys 


gly 


arg 


arg 


ser 


arg 


arg 



303/101 

ATC GGT CGC CGA TC 
ile gly arg arg) 



SEQ XD NOS: 92-95 

FIG. 5C 



part of the nucleotide sequence Seq 5A 



1/l^SEQ ID NO: 96 31/11 



CGC CGC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GTT 


GCT 


GCC 


GTT 


CGA 


GGA 


GGC 


GCC 


GGA 


GAT 


arg arg 


gly 


asp 


gly 


asp 


gly 


asp 


gly 


asp 


yal 


ala 


ala 


val 


arg 


gly 


gly 


ala 


gly 


asp 


61/21^- 


SEQ 


ID NO: 


97 
















91/31 












GAC CAG 


CGC 


GGG 


TGG 


GCT 


CCT 


CGA 


GCA 


GGC 


CGC 


CGC 


GGT 


CGA 


GGA 


GGC 


CTC 


CGA 


CAC 


CGC 


asp gin 


arg 


gly 


trp 


ala 


pro 


arg 


ala 


gly 


arg 


arg 


gly 


arg 


gly 


gly 


leu 


arg 


his 


arg 


121/41 


















151/51 


















CGC . GGC 


GAA 


CCA 


GTT 


GAT 


GAA 


CAA 


TGT 


GCC 


CCA 


GGC 


GCT 


GCA 


ACA 


GCT 


GGC 


CCA 


GCC 


CAC 


arg gly 


glu 


pro 


val 


asp 


glu 


gin 


cys 


ala 


pro 


gly 


ala 


ala 


thr 


ala 


gly 


pro 


ala 


his 


181/61 


















271/71 


















GCA GGG 


CAC 


CAC 


GCC 


TTC 


TTC 


CAA 


GCT 


GGG 


TGG 


CCT 


GTG 


GAA 


GAC 


GGT 


CTC 


GCC 


GCA 


TCG 


ala gly 


his 


his 


ala 


phe 


phe 


gin 


ala 


gly 


trp 


pro 


val 


glu 


asp 


gly 


leu 


ala 


ala 


ser 



241/81 

GTC GCC GAT C 
val ala asp 



SEQ ID NOS. 96-97 



FIG. 5A' 



?6 2MB « 

4 

— SEQ ID NO: 98 

TAC GCC GCG GCG ACG GCG ACG GCG ACG GCG 
tyr ala ala ala thr ala thr ala thr ala 
61/21 SEQ ID NO 5 99 

ATG ACC AGC GCG GGT GGG CTC CTC GAG CAG 
met thr ser ala gly gly leu leu glu gin 
121/41 

GCC GCG GCG AAC CAG TTG ATG AAC AAT GTG 
ala ala ala asn gin leu met asn asn val 
181/61 

ACG CAG GGC ACC ACG CCT TCT TCC AAG CTG 
thr gin gly thr thr pro ser ser lys leu 
241/81 

CGG TCG CCG ATC 
arg ser pro ile 



31/11 

ACG TTG CTG CCG TTC GAG GAG GCG CCG GAG 
thr leu leu pro phe glu glu ala pro glu 
91/31 

GCC GCC GCG GTC GAG GAG GCC TCC GAC ACC 
ala ala ala val glu glu ala ser asp thr 
151/51 

CCC CAG GCG CTG CAA CAG CTG GCC CAG CCC 
pro gin ala leu gin gin leu ala gin pro 
211/71 

GGT GGC CTG TGG AAG ACG GTC TCG CCG CAT 
gly gly leu trp lys thr val ser pro his 



SEQ ID NOS : 98-99 

FIG. 5B' 



1 /1 ^ SEQ ID NO: 100 

ACG CCG CGG CGA CGG CGA CGG CGA CGG 
( thr pro arg arg arg arg arg arg arg 
6*1/21 SEQ ID NO: 101 

TGA CCA GCG CGG GTG GGC TCC TCG AGC 
OPA(pro ala arg val gly ser ser ser 
121/4T^-SEQ ID NO: 102 
CCG CGG CGA ACC AGT TGA TGA ACA ATG 
pro arg arg thr ser)OPA 0PA(thr met 
181/61 SEQ ID NO: 103-^ 

CGC AGG GCA CCA CGC CTT CTT CCA AGC 
arg arg ala pro arg leu leu pro ser 
241/81 

GGT CGC CGA TC 
gly arg arg) 



31/11 

CGA CGT TGC TGC CGT TCG AGG AGG CGC CGG AGA 
arg arg cys cys arg ser arg arg arg arg arg) 
91/31 

AGG CCG CCG CGG TCG AGG AGG CCT CCG ACA CCG 
arg pro pro arg ser arg arg pro pro thr pro 
151/51 

TGC CCC AGG CGC TGC AAC AGC TGG CCC AGC CCA 
cys pro arg arg cys asn ser trp pro ser pro 
211/71 

TGG GTG GCC TGT GGA AGA CGG TCT CGC CGC ATC 
trp val ala cys gly arg arg ser arg arg ile 



SEQ XD NOS: 100-103 



FIG. 5C 




ORF predicted by Cole et al . (Nature 393:537-544) and containing seq5A' 
1/1_^-SEQ ID NO: 104 31/11 

tga act gat gat tct gat age gac caa cct ctt ggg gca aaa cac ccc ggc gat cgc ggt 
OPA(thr asp asp ser asp ser asp gin pro leu gly ala lys his pro gly asp arg gly 
61/21^ SEQ ID NO: 105 91/31 

caa cga ggc cga ata egg cga gat gtg ggc cca aga cgc cgc cgc gat gtt tgg eta cgc 
gin arg gly arg ile arg arg asp val gly pro arg arg arg arg asp val trp leu arg 
121/41 151/51 

cgc ggc gac ggc gac ggc gac ggc gac gtt get gee gtt cga gga ggc gec gga gat gac 
arg gly asp gly asp gly asp gly asp val ala ala val arg gly gly ala gly asp asp 
181/61 211/71 

cag cgc ggg tgg get cct cga gca ggc cgc cgc ggt cga gga ggc etc cga cac cgc cgc 
gin arg gly trp ala pro arg ala gly arg arg gly arg gly gly leu arg his arg arg 
241/81 271/91 

ggc gaa cca gtt gat gaa caa tgt gec cca ggc get gca aca get ggc cca gee cac gca 
gly glu pro val asp glu gin cys ala pro gly ala ala thr ala gly pro ala his ala 
301/101 331/111 

ggg cac cac gec ttc ttc caa get ggg tgg cct gtg gaa gac ggt etc gee gca teg gtc 
gly his his ala phe phe gin ala gly trp pro val glu asp gly leu ala ala ser val 
361/121 391/131 

gee gat cag caa cat ggt gtc gat ggc caa caa cca cat gtc gat gac caa etc ggg tgt 
ala asp gin gin his gly val asp gly gin gin pro his val asp asp gin leu gly cys 
421/141 451/151 

gtc gat gac caa cac ctt gag etc gat gtt gaa ggg ctt tgc tec ggc ggc ggc cgc cca 
val asp asp gin his leu glu leu asp val glu gly leu cys ser gly gly gly arg pro 
481/161 511/171 

ggc cgt gca aac cgc ggc gca aaa egg ggt ccg ggc gat gag etc get ggg cag etc get 
gly arg ala asn arg gly ala lys arg gly pro gly asp glu leu ala gly gin leu ala 
541/181 571/191 

ggg ttc ttc ggg tct ggg egg tgg ggt ggc cgc caa ctt ggg teg ggc ggc etc ggt egg 
gly phe phe gly ser gly arg trp gly gly arg gin leu gly ser gly gly leu gly arg 
601/201 631/211 

ttc gtt gtc ggt gec gca ggc ctg ggc cgc ggc caa cca ggc agt cac ccc ggc ggc gcg 
phe val val gly ala ala gly leu gly arg gly gin pro gly ser his pro gly gly ala 
661/221 691/231 

ggc get gec get gac cag cct gac cag cgc cgc gga aag agg gee egg gca gat get ggg 
gly ala ala ala asp gin pro asp gin arg arg gly lys arg ala arg ala asp ala gly 
721/241 751/251 

egg get gee ggt ggg gca gat ggg cgc cag ggc egg tgg tgg get cag tgg tgt get gcg 
arg ala ala gly gly ala asp gly arg gin gly arg trp trp ala gin trp cys ala ala 
781/261 811/271 

tgt tec gee gcg ace eta tgt gat gee gca ttc tec ggc ggc egg eta gga gag ggg gcg 
cys ser ala ala thr leu cys asp ala ala phe ser gly gly arg leu gly glu gly ala 
841/281 

cag act gtc gtt att tga 
gin thr val val ile)OPA 



SEQ ID NOS: 104-105 



FIG. 5D" 




of encoding an ORF fused with SeqSA' 
1/1^- SEQ ID NO: 106 



atg gtg 


gat 


ttc 


ggg 


gcg 


tta 


cca 


ccg 


gag 


Met val 
61/21 - 


asp 


phe 


gly 


ala 


leu 


pro 


pro 


glu 


— SEQ ID 


NO: 


107 








ggt teg 


gee 


teg 


ctg 


gtg 


gee 


gcg 


get 


cag 


gly ser 


ala 


ser 


leu - 


val 


ala 


ala 


ala 


gin 


121/41 


















teg gee 


gcg 


teg 


gcg 


ttt 


cag 


teg 


gtg 


gtc 


ser ala 


ala 


ser 


ala 


phe 


gin 


ser 


val 


val 


181/61 


















teg teg 


gcg 


ggt 


ctg 


atg 


gtg 


gcg 


gcg 


gec 


ser ser 


ala 


gly 


leu 


met 


val 


ala 






ala ala 


ser 


pro 


tyr 


val 


ala 


trp 


met 


ser 


241/81 


















gcg ggg 


cag 


gec 


craci 


ctg 


acc 


gec 


gee 


cag 


ala gly 


gin 


ala 


glu 


leu 


thr 


ala 


ala 


gin 


301/101 


















gcg tat 


ggg 


ctg 


acg 


gtg 


ccc 


ccg 


ccg 


gtg 


ala tyr 


gly 


leu 


thr 


val 


pro 


pro 


pro 


val 


361/121 


















ctg ata 


gcg 


acc 


aac 


etc 


ttg 


ggg 


caa 


aac 


leu ile 


ala 


thr 


asn 


leu 


leu 


gly 


gin 


asn 


421/141 


















tac ggc 


gag 


atg 


tgg 


gec 


caa 


gac 


gec 


gec 


tyr gly 


glu 


met 


trp 


ala 


gin 


asp 


ala 


ala 


481/161 


















acg gcg 


acg 


gcg 


acg 


ttg 


ctg 


ccg 


ttc 


gag 


thr ala 


thr 


ala 


thr 


leu 


leu 


pro 


phe 


glu 


541/181 


















etc etc 


gag 


cag 


gee 


gee 


gcg 


gtc 


gag 


gag 


leu leu 


glu 


gin 


ala 


ala 


ala 


val 


glu 


glu 


601/201 


















atg aac 


aat 


gtg 


ccc 


cag 


gcg 


ctg 


caa 


cag 


met asn 


asn 


val 


pro 


gin 


ala 


leu 


gin 


gin 


661/221 


















tct tec 


aag 


ctg 


ggt 


ggc 


ctg 


tgg 


aag 


acg 


ser ser 


lys 


leu 


gly 


gly 


leu 


trp 


lys 


thr 


721/241 


















atg gtg 


teg 


atg 


gec 


aac 


aac 


cac 


atg 


teg 


met val 


ser 


met 


ala 


asn 


asn 


his 


met 


ser 


781/261 


















ace ttg 


age 


teg 


atg 


ttg 


aag 


ggc 


ttt 


get 


thr leu 


ser 


ser 


met 


leu 


lys 


gly 


phe 


ala 


841/281 


















gcg gcg 


caa 


aac 


ggg 


gtc 


egg 


gcg 


atg 


age 


ala ala 


gin 


asn 


gly 


val 


arg 


ala 


met 


ser 


901/301 


















ctg ggc 


ggt 


ggg 


gtg 


gec 


gee 


aac 


ttg 


ggt 


leu gly 


gly 


gly 


val 


ala 


ala 


asn 


leu 


gly 


961/321 


















ccg cag 


gec 


tgg 


gee 


gcg 


gee 


aac 


cag 


gca 


pro gin 


ala 


trp 


ala 


ala 


ala 


asn 


gin 


ala 


1021/341 
















ace age 


ctg 


acc 


age 


gee 


gcg 


gaa 


aga 


ggg 


thr ser 


leu 


thr 


ser 


ala 


ala 


glu 


arg 


gly 


1081/361 
















ggg cag 


atg 


ggc 


gec 


agg 


gee 


ggt 


ggt 


ggg 


gly gin 


met 


gly 


ala 


arg 


ala 


gly 


gly 


gly 


1141/381 
















ccc tat 


gtg 


atg 


ccg 


cat 


tct 


ccg 


gcg 


gee 


pro tyr 


val 


met 


pro 


his 


ser 


pro 


ala 


ala 



SEQ 



(Nature 393:537-544) and capable 



31/11 


















ate aac 


tec 


gcg 


agg 


atg 


tac 


gee 


ggc 


ccg 


ile asn 


ser 


ala 


arg 


met 


tyr 


ala 


gly 


pro 


91/31 


















atg tgg 


gac 


age 


gtg 


gcg 


agt 


gac 


ctg 


ttt 


met trp 


asp 


ser 


val 


ala 


ser 


asp 


leu 


phe 


151/51 


















tgg ggt 


ctg 


acg 


gtg 


ggg 


teg 


tgg 


ata 


ggt 


trp gly 


leu 


thr 


val. 


gly 


ser 


trp 


ile 


gly 


211/71 


















teg ccg 


tat 


gtg 


gcg 


tgg 


atg 


age 


gtc 


acc 


val thr 


















271/91 


















gtc egg 


gtt 


get 


gcg 


gcg 


gee 


tac 


gag 


acg 


val arg 


val 


ala 


ala 


ala 


ala 


tyr 


glu 


thr 


331/111 


















ate gec 


gag 


aac 


cgt 


get 


gaa 


ctg 


atg 


att 


ile ala 


glu 


asn 


arg 


ala 


glu 


leu 


met 


ile 


391/131 


















acc ccg 


gcg 


ate 


gcg 


gtc 


aac 


gag 


gee 


gaa 


thr pro 


ala 


ile 


ala 


val 


asn 


glu 


ala 


glu 


451/151 


















gcg atg 


ttt 


ggc 


tac 


gec 


gcg 


gcg 


acg 


gcg 


ala met 


phe 


gly 


tyr 


ala 


ala 


ala 


thr 


ala 


511/171 


















gag gcg 


ccg 


gag 


atg 


acc 


age 


gcg 


ggt 


ggg 


glu ala 


pro 


glu 


met 


thr 


ser 


ala 


gly 


gly 


571/191 


















gee tec 


gac 


acc 


gec 


gcg 


gcg 


aac 


cag 


ttg 


ala ser 


asp 


thr 


ala 


ala 


ala 


asn 


gin 


leu 


631/211 


















ctg gee 


cag 


ccc 


acg 


cag 


ggc 


acc 


acg 


cct 


leu ala 


gin 


pro 


thr 


gin 


gly 


thr 


thr 


pro 


691/231 


















gtc teg 


ccg 


cat 


egg 


teg 


ccg 


ate 


age 


aac 


val ser 


pro 


his 


arg 


ser 


pro 


ile 


ser 


asn 


751/251 


















atg acc 


aac 


teg 


ggt 


gtg 


teg 


atg 


acc 


aac 


met thr 


asn 


ser 


gly 


val 


ser 


met 


thr 


asn 


811/271 


















ccg gcg 


gcg 


gec 


gee 


cag 


gec 


gtg 


caa 


acc 


pro ala 


ala 


ala 


ala 


gin 


ala 


val 


gin 


thr 


871/291 


















teg ctg 


ggc 


age 


teg 


ctg 


ggt 


tct 


teg 


ggt 


ser leu 


gly 


ser 


ser 


leu 


gly 


ser 


ser 


gly 


931/311 


















egg gcg 


gee 


teg 


gtc 


ggt 


teg 


ttg 


teg 


gtg 


arg ala 


ala 


ser 


val 


gly 


ser 


leu 


ser 


val 


991/331 


















gtc acc 


ccg 


gcg 


gcg 


egg 


gcg 


ctg 


ccg 


ctg 


val thr 


pro 


ala 


ala 


arg 


ala 


leu 


pro 


leu 


1051/351 
















ccc ggg 


cag 


atg 


ctg 


ggc 


ggg 


ctg 


ccg 


gtg 


pro gly 


gin 


met 


leu 


gly 


gly 


leu 


pro 


val 


1111/371 
















etc agt 


ggt 


gtg 


ctg 


cgt 


gtt 


ccg 


ccg 


cga 


leu ser 


gly 


val 


leu 


arg 


val 


pro 


pro 


arg 


1171/391 
















ggc tag 


















gly ) AMB 



















NOS:106-107 



FIG. 5E* 



Seq 5P: ORF according to Cole et al . (Nature 393:537-544) and containing the 
sequence Rvll96 



1/1_^ SEQ ID NO: 108 31/11 

tag gga cac gta atg gtg gat ttc ggg gcg tta cca ccg gag ate aac tec gcg agg atg 
AMB(gly his val met val asp phe gly ala leu pro pro glu ile asn ser ala arg met 
61/21"* SEQ ID NO: 109 91/31 

tac gec ggc ccg ggt teg gec teg ctg gtg gee gcg get cag atg tgg gac age gtg gcg 
tyr ala gly pro gly ser ala ser leu val ala ala ala gin met trp asp ser val ala 
121/41 151/51 

agt gac ctg ttt teg gee gcg teg gcg ttt cag teg gtg gtc tgg ggt ctg acg gtg ggg 
ser asp leu phe ser ala ala ser ala phe gin ser val val trp gly leu thr val gly 
181/61 211/71 

teg tgg ata ggt teg teg gcg ggt ctg atg gtg gcg gcg gec teg ccg tat gtg gcg tgg 
ser trp ile gly ser ser ala gly leu met val ala ala ala ser pro tyr val ala trp 
241/81 271/91 

atg age gtc ace gcg ggg cag gec gag ctg ace gee gec cag gtc egg gtt get gcg gcg 
met ser val thr ala gly gin ala glu leu thr ala ala gin val arg val ala ala ala 
301/101 331/111 

gee tac gag acg gcg tat ggg ctg acg gtg ccc ccg ccg gtg ate gec gag aac cgt get 
ala tyr glu thr ala tyr gly leu thr val pro pro pro val ile ala glu asn arg ala 
361/121 391/131 

gaa ctg atg att ctg ata gcg ace aac etc ttg ggg caa aac ace ccg gcg ate gcg gtc 
glu leu met ile leu ile ala thr asn leu leu gly gin asn thr pro ala ile ala val 
421/141 451/151 

aac gag gee gaa tac ggc gag atg tgg gee caa gac gee gee gcg atg ttt ggc tac gee 
asn glu ala glu tyr gly glu met trp ala gin asp ala ala ala met phe gly tyr ala 
481/161 511/171 

gcg gcg acg gcg acg gcg acg gcg acg ttg ctg ccg ttc gag gag gcg ccg gag atg acc 
ala ala thr ala thr ala thr ala thr leu leu pro phe glu glu ala pro glu met thr 
541/181 571/191 

age gcg ggt ggg etc etc gag cag gee gee gcg gtc gag gag gee tec gac acc gee gcg 
ser ala gly gly leu leu glu gin ala ala ala val glu glu ala ser asp thr ala ala 
601/201 631/211 

gcg aac cag ttg atg aac aat gtg ccc cag gcg ctg caa cag ctg gec cag ccc acg cag 
ala asn gin leu met asn asn val pro gin ala leu gin gin leu ala gin pro thr gin 
661/221 691/231 

ggc acc acg cct tct tec aag ctg ggt ggc ctg tgg aag acg gtc teg ccg cat egg teg 
gly thr thr pro ser' ser lys leu gly gly leu trp lys thr val ser pro his arg ser 
721/241 751/251 

ccg ate age aac atg gtg teg atg gec aac aac cac atg teg atg acc aac teg ggt gtg 
pro ile ser asn met val ser met ala asn asn his met ser met thr asn ser gly val 
781/261 811/271 

teg atg acc aac acc ttg age teg atg ttg aag ggc ttt get ccg gcg gcg gee gee cag 
ser met thr asn thr leu ser ser met leu lys gly phe ala pro ala ala ala ala gin 
841/281 871/291 

gec gtg caa acc gcg gcg caa aac ggg gtc egg gcg atg age teg ctg ggc age teg ctg 
ala val gin thr ala ala gin asn gly val arg ala met ser ser leu gly ser ser leu 
901/301 931/311 

ggt tct teg ggt ctg ggc ggt ggg gtg gee gee aac ttg ggt egg gcg gee teg gtc ggt 
gly ser ser gly leu gly gly gly val ala ala asn leu gly arg ala ala ser val gly 
961/321 991/331 

teg ttg teg gtg ccg cag gee tgg gec gcg gee aac cag gca gtc acc ccg gcg gcg egg 
ser leu ser val pro gin ala trp ala ala ala asn gin ala val thr pro ala ala arg 
1021/341 1051/351 

gcg ctg ccg ctg acc age ctg acc age gec gcg gaa aga ggg ccc ggg cag atg ctg ggc 
ala leu pro leu thr ser leu thr ser ala ala glu arg gly pro gly gin met leu gly 
1081/361 1111/371 

ggg ctg ccg gtg ggg cag atg ggc gee agg gec ggt ggt ggg etc agt ggt gtg ctg cgt 
gly leu pro val gly gin met gly ala arg ala gly gly gly leu ser gly val leu arg 
1141/381 1171/391 

gtt ccg ccg cga ccc tat gtg atg ccg cat tct ccg gcg gee ggc tag 
val pro pro arg pro tyr val met pro his ser pro ala ala gly)AMB 



SEQ ID NOS: 108-109 



FIG. 5P 



MAR 2 6 2003 



SEQ ID 

GGA TCC TGA 
gly ser OPA 
61/21 

TTG GTG CAT 

leu vaT^his 

121/41 

CTT GCC CAC 

leu ala his 

181/61 

GCG GTG CCT 

ala val pro 

241/81 

TCA CCA ACG 

ser pro thr 

301/101 

GCT GTA CCC 

ala val pro 

361/121 

GCA GCC GAG 

ala ala glu 

421/141 

GAA GGA GAA 

glu gly glu 



NO: 110 

TGC AAG TGG 
(cys lys trp 
V SEQ ID NO: 

CCG GGC TGC 
pro gly cys 



TGT 
cys 

GGG 
gly 

TCG 
ser 

TCA 
ser 



TCC 
ser 

CAT 
his 

AAA 
lys 

CCG 
pro 



TCC 
ser 

CCA 
pro 

CCG 
pro 

AAC 
asn 



CTC AAG GTG 
leu lys val 

GAT C 
asp) 



TCC GGG 
ser gly 

111 
GAG CAT 
glu his 

CTG CCG 
leu pro 

GCA AAA 
ala lys 

ACG GTT 
thr val 

GTG AGT 
val ser 

TCC CGC 
ser arg 



ATT 
ile 

GCA 
ala 

GCA 
ala 

CGG 
arg 

GCC 
ala 

CCC 
pro 

ACC 
thr 



TGT 
cys 

CGC 
arg 

CCA 
pro 

GGA 

gly 

AGC 
ser 

ACA 
thr 

ACT 
thr 



31/11 
CGG CAG 
arg gin 
91/31 
ACC GAC 
thr asp 
151/51 
TGT GCG 
cys ala 
211/71 
GCT CAA 
ala gin 
271/91 
CGG CCC 
arg pro 
331/111 
CCG CGG 
pro arg 
391/131 
GTC TCG 
val ser 







CGG 


TCC 


CGT 


CGA 


CCA 


ACG 


pro 


arg 


arg 


ser 


arg 


arg 


pro 


thr 






GGC 


GAG 


i~\ r-» 


GGC 


TAG 


CTG 


gin 


arg 


gly 


glu 


arg 


gly)AMB(leu 






SEQ ID 


NO: 


112 








TTA 


AGC 


CjLA 


GCA 


GTA 


CCG 


thr 


ser 


leu 


ser 


ala 


ala 


val 


pro 


PA 7\ 




TTC 


ATG 


A A /~* 


GAG 


GGG 


TCG 




arg 


phe 


met 


asn 


glu 


gly 


sec 


ACG 


ATA 


TTG 


CGT 


GCT 


CGA 


GGG 


TCC 


thr 


ile 


leu 


arg 


ala 


arg 


gly 


ser 


AGG 


CGG 


GCG 


ACT 


CTG 


GCG 


TCG 


TTA 


arg 


arg 


ala 


thr 


leu 


ala 


ser 


leu 


AAT 


GCT 


TTT 


AAC 


CGA 


CCG 


GAT 


CCA 


asn 


ala 


phe 


asn 


arg 


pro 


asp 


pro 



SEQ ID NC 

FIG 



^SEQ ID NO: 


113 












GAT CCT 


GAT 


GCA 


AGT 


GGT 


CCG 


GGA 


TTT 


GTC 


(asp pro 


asp 


ala 


ser 


gly 


pro 


gly 


phe 


val 


62/21 ""^ 


— SEQ ID 


NO: 


114 








TGG TGC 


ATC 


CGG 


GCT 


GCG 


AGC 


ATG 


CAC 


GCA 


trp cys 


ile 


arg 


ala 


ala 


ser 


met 


his 


ala 


122/41 


















TTG CCC 


ACT 


GTT 


CCT 


CCC 


TGC 


CGG 


CAC 


CAT 


leu pro 


thr 


val 


pro 


pro 


cys 


arg 


his 


his 


182/61 


















CGG TGC 


CTG 


GGC 


ATC 


CAG 


CAA 


AAC 


GGG 


GAG 


arg cys 


leu 


gly 


ile 


gin 


gin 


asn 


gly 


glu 


242/81 


















CAC CAA 


CGT 


CGA 


AAC 


CGA 


CGG 


TTG 


CCA 


GCC 


his gin 


arg 


arg 


asn 


arg 


arg 


leu 


pro 


ala 


302/101 


















CTG TAC 


CCT 


CAC 


CGA 


ACG 


TGA 


GTC 


CCA 


CAC 


leu tyr 


pro 


his 


arg 


thr) OPA (val 


pro 


his 


362/121 


SEQ ID NO: 


117 * 








CAG CCG 


AGC 


TCA 


AGG 


TGT 


CCC 


GCA 


CCA 


CTG 


(gin pro 


ser 


ser 


arg 


cys 


pro 


ala 


pro 


leu 


422/141^ 


■ SEQ ID NO: 


118 








AAG GAG 


AAG 


ATC 














lys glu 


lys 


ile) 













SEQ ID 



►SsllO-112 

. 6A 



32/11 

GGC AGC CAC GGC GGT CCC GTC GAC CAA CGT 
gly ser his gly gly pro val asp gin arg 
92/31 

CCG ACC AGC GCG GCG AGC GCG GCT AGC TGC 
pro thr ser ala ala ser ala ala ser cys 
152/51 

GTG CGA CAA GCT TAA GCG CAG CAG TAC CGG 
val arg gin ala)OCH(ala gin gin tyr arg 
212/71 ^SEQ ID NO: 115 

CTC AAG AAC GAT TCA TGA ACG AGG GGT CGT 
leu lys asn asp ser) OPA (thr arg gly arg 
272/91 SEQ ID NO: 116—*' 

GGC CCA CGA TAT TGC GTG CTC GAG GGT CCG 
gly pro arg tyr cys val leu glu gly pro 
332/111 

CGC GGA GGC GGG CGA CTC TGG CGT CGT TAG 
arg gly gly gly arg leu trp arg arg)AMB 
392/131 

TCT CGA ATG CTT TTA ACC GAC CGG ATC CAG 
ser arg met leu leu thr asp arg ile gin 



5 113-118 



FIG. 6B 




ATC CTG 


ATG 


CAA 


GTG 


GTC 


CGG 


GAT 


TTG 


TCG 


GCA GCC 


ACG 


GCG 


GTC 


CCG 


TCG 


ACC 


AAC 


GTT 


(ile leu 


met 


gin 


val 


val 


arg 


asp 


leu 


ser 


ala ala 


thr 


ala 


val 


pro 


ser 


thr 


asn 


val 


63/21 


- SEQ ID 


NO: 


120 








93/31 


















GGT GCA 




\J\J\J 


CTG 






TGC 


ACG 


CAC 


CGA CCA 


GCG 


CGG 


CGA 


GCG 


CGG 


CTA 


GCT 


GCT 


gly ala 


ser 


gly 


leu 


arg 


ala 


cys 


thr 


his 


arg pro 


ala 


arg 


arg 


ala 


arg 


leu 


ala 


ala 


123/41 


















153/51 


















TGC CCA 






CTC 


V, 1 


err* 


GGC 


ACC 


ATG 


TGC GAC 


AAG 


CTT 


AAG 


CGC 


AGC 


AGT 


ACL 




cys pro 


leu 


phe 


leu 


pro 


a J. a 


gly 


thr 


met 


cys asp 


lys 


leu 


lys 


arg 


scr 


ser 


thr 


gly 


183/61 


















213/71 


















GGT GCC 


1 GG 


GCA 


TCC 


AGC 


AAA 

AAA 


ACG 


GGG 


AGC 


TCA AGA 


ACG 


ATT 


CAT 


GAA 


CGA 


GGG 


GTC 


GTC 


gly ala 


trp 


ala 


ser 


ser 


lys 


thr 


gly 


ser 


ser arg 


thr 


ile 


his 


glu 


arg 


gly 


val 


val 


243/81 


















273/91 


















ACC AAC 


GTC 


GAA 


ACC 


GAC 


GGT 


TGC 


CAG 


CCG 


GCC CAC 


GAT 


ATT 


GCG 


TGC 


TCG 


AGG 


GTC 


CGC 


thr asn 


vd J. 


glu 


thr 


asp 


gly 


cys 


gin 


pro 


ala his 


asp 


ile 


ala 


cys 


ser 


arg 


val 


arg 


303 /101 


















333/111 


















TGT ACC 


CTC 


ACC 


GAA 


CGT 


GAG 


TCC 


CAC 


ACC 


GCG GAG 


GCG 


GGC 


GAC 


TCT 


GGC 


GTC 


GTT 


AGC 


cys thr 


leu 


thr 


giu 


arg 


glu 


ser 


nis 


thr 


ala glu 


ala 


gly 


asp 


ser 


gly 


val 


val 


ser 


363/121 


















393/131 


















AGC CGA 


GCT 


CAA 


GGT 


GTC 


CCG 


CAC 


CAC 


TGT 


CTC GAA 


TGC 


TTT 


TAA 


CCG 


ACC 


GGA 


TCC 


AGA 


ser arg 


ala 


gin 


gly 


val 


pro 


his 


his 


cys 


leu glu 


cys 


phe)OCH(pro 


thr 


gly 


ser 


arg 


423/141 
















SEQ ID NO: 


: 121 












AGG AGA 


AGA 


TC 
































arg arg 


arg) 














































SEQ XD NOS:119- 


121 






























FIG. 6C 


















_^~SEQ ID NO: 122 












31/11 


















CCG TCG 


GCA 


ACT 


TGG 






AGG 


TCG 


GCT 


TGA TCC 


CTG 


GGC 


CGA 


GGC 


GGG 


TCA 


GCC 


AAT 


(pro ser 


ala 


thr 


trp 


pro 


leu 


arg 


ser 


ala)OPA(ser 


leu 


gly 


arg 


gly 


gly 


ser 


ala 


asn 


61721^-- 


SEQ 


ID NO: 












91/31^- 


-SEQ 


ID NO: 


124 










AGC GGC 


TCC 


ATC 


GGC 


TTT 




GGT 


AGC 


GGT 


TCG GCG 


GGA 


AGC 


TAG 


CGG 


CGA 


CGT 


TGT 


CGG 


ser gly 


ser 


ile 


gly 


phe 


ala 


gly 


ser 


gly 


ser ala 


gly 


ser) AMB (arg 


arg 


arg 


cys 


arg 


121/41 


















151/51 








V SEQ ID 


NO : 


125 


TGG CCG 


GTG 


ATA 


TAT 






AGA 


CGG 


GTA 


TGG CGG 


CGG 


CTG 


AGG 


TGA 


TCT 


GCG 


ACA 


CGC 


trp pro 


val 


ile 


tyr 


trp 


val 


arg 


arg 


val 


trp arg 


arg 


leu 


arg)OPA(ser 


ala 


thr 


arg 


181/61 


















211/71 


SEQ 


ID NO: 


126 










CGC CGC 


GGT 


GCT 


CGA 


GCC 


AGG 


CTT 


ACG 


ACC 


AGG GAA 


TTT 


CGA 


AAA 


TGT 


TAT 


TCA 


GAA 


CAT 


arg arg 


gly 


ala 


arg 


ala 


arg 


leu 


thr 


thr 


arg glu 


phe 


arg 


lys 


cys 


tyr 


ser 


glu 


his 


241/81 


















271/91 


















CTT GTA 


TCT 


CTT 


CTC 


CGT 


GCC 


ACC 


CCC 


TAG 


GTG TAG 


TGT 


TTT 


CGA 


GTA 


CCG 


GCA 


GAT 


CCC 


leu val 


ser 


leu 


leu 


arg 


ala 


thr 


pro) AMB 


val AMB ( cys 


phe 


arg 


val 


pro 


ala 


asp 


pro 


301/101 






















— SEQ ID 


NO: 


127 






AGG TTC 


ACC 


AGG 


TCT 


CAC 


CAG 


ATC 
























arg phe 


thr 


arg 


ser 


his 


gin 


ile) 























SEQ ID NOS: 122-127 



FIG. 7A 



-SEQ ID NO: 128 32/11 



CGT CGG 


CAA 


CTT 


GGC 


CGC 


TGA 


GGT 


CGG 


CTT 


GAT CCC 


TGG 


GCC 


GAG 


GCG 


GGT 


CAG 


CCA 


ATA 


(arg arg 


gin 


, leu 


gly 


arg)OPA(gly 


arg 


leu 


asp pro 


trp 


ala 


glu 


ala 


gly 


gin 


pro 


ile 


62/21 SEQ ID NO: 


129 










92/ 31^ SEQ ID NO: 130 










GCG GCT 


CCA 


TCG 


GCT 


TTG 


CTG 


GTA 


GCG 


GTT 


CGG CGG 


GAA 


GCT 


AGC 


GGC 


GAC 


GTT 


GTC 


GGT 


- ala ala 

122/41 


pro 


ser 


ala 


leu 


leu 


val 


ala 


val 


arg arg 
152/51 


glu 


ala 


ser 


gly 


asp 


val 


val 


gly 


GGC CGG 


TGA 


TAT 


ATT 


GGG 


TCA 


GAC 


GGG 


TAT 


GGC GGC 


GGC 


TGA 


GGT 


GAT 


CTG 


CGA 


CAC 


GCC 


gly arg)OPA(tyr 
182/61 *SEQ 


ile gly 
ID NO: 


ser 
131 


asp 


gly 


tyr 


gly gly 
212/71 


gly)OPA(gly asp leu 
^SEQ ID NO: 


arg his 
132 


ala 


GCC GCG 


GTG 


CTC 


GAG 


CCA 


GGC 


TTA 


CGA 


CCA 


GGG AAT 


TTC 


GAA 


AAT 


GTT 


ATT 


CAG 


AAC 


ATC 


ala ala 
242/81 


val 


leu 


glu 


pro 


gly 


leu 


arg 


pro 


gly asn 
272/91 


phe 


glu 


asn 


val 


ile 


gin 


asn 


ile 


TTG TAT 


CTC 


TTC 


TCC 


GTG 


CCA 


CCC 


CCT 


AGG 


TGT AGT 


GTT 


TTC 


GAG 


TAC 


CGG 


CAG 


ATC 


CCA 


leu tyr 
302/101 


leu 


phe 


ser 


val 


pro 


pro 


pro 


arg 


cys ser 


val 


phe 


glu 


tyr 


arg 


gin 


ile 


pro 


GGT TCA 


CCA 


GGT 


CTC 


ACC 


AGA 


TC 
























gly ser 


pro 


gly 


leu 


thr 


arg) 



























SEQ ID NOS: 128-132 

FIG. 7B 



^SEQ ID NO: 133 33/11 

GTC GGC AAC TTG GCC GCT GAG GTC GGC TTG ATC CCT GGG CCG AGG CGG GTC AGC CAA TAG 

( val g ly asn leu ala ala glu val gly leu ile pro gly pro arq arg val ser gin ) AMB 

63/2 1~-~^ SEQ ID NO : 134 93/31 

CGG CTC CAT CGG CTT TGC TGG TAG CGG TTC GGC GGG AAG CTA GCG GCG ACG TTG TCG GTG 
( arg leu his arg leu cys trp) AMB (arg ph e gly gly lys leu ala ala thr leu ser val 

12 3/41 SEQ ID NO: 135 ^ " 153/51 —SEQ ID NO: 136 

GCC GGT GAT ATA TTG GGT CAG ACG GGT ATG GCG GCG GCT GAG GTG ATC TGC GAC ACG CCG 

ala gly asp ile leu gly gin thr gly met ala ala ala glu val ile cys asp thr pro 

183/61 213/71 

CCG CGG TGC TCG AGC CAG GCT TAC GAC CAG GGA ATT TCG AAA ATG TTA TTC AGA ACA TCT 

pro arg cys ser ser gin ala tyr asp gin gly ile ser lys met leu phe arg thr ser 

243/81 273/91 

TGT ATC TCT TCT CCG TGC CAC CCC CTA GGT GTA GTG TTT TCG AGT ACC GGC AGA TCC CAG 

cys ile ser ser pro cys his pro leu gly val val phe ser ser thr gly arg ser gin 
303/101 

GTT CAC CAG GTC TCA CCA GAT C 
val his gin val ser pro asp) 



SEQ ID NOS: 133-136 



FIG. 7C 



_Q ID NO: 137 

TGC GTG ATG TCC AAT GGC 
( leu _c ys val met ser asn gly 

61/21^ SEQ ID NO: 138 

TAG TTT TCG CGA TGA CGC TCG 
AMB phe ser arg OPA arg ser 
121/41 

AGC GAA CCG TCG ATT CCA TGG 
ser glu pro ser ile pro trp 
181/61 

CGT TCC TGG GTT CGA CAC CAT 
arg ser trp val arg his his 
241/81 

GCG TCT TGG CTG CCG CCC CAC 
ala ser trp leu pro pro his) 
301/101 SEQ ID NO: 273- 

CGC GAA ACG TCA CCG CGT TCG 
arg glu thr ser pro arg ser 
361/121 

ACG GTC GGG CGC CAT CGA CGC 
thr val gly arg his arg arg 
421/141 

TCG GCG ACG ATC 
ser ala thr ile) 



GAA AAC 
glu asn 

TTC TGA 
phe OPA 

GTG TCG 
val ser 

TCG CCC 
ser pro 

TAG CGA 
AMB (arg 



GGG CAC 
gly his 

CGG ACG 
arg thr 



GAC 
asp 

CCG 
pro 

ACG 
thr 

AAA 
lys 

CTG 
leu 

CAG 
gin 

AGG 
arg 



31/11 
GCC TTG 
ala leu 
91/31 
GAC TTG 
asp leu 
151/51 
CAT TCG 
his ser 
211/71 
TCG ACC 
ser thr 
271/91 
CGC CGT 
arg arg 
331/111 
AGC ACG 
ser thr 
391/131 
TCG CGG 
ser arg 



TCA TCG CAA TCG TCA GCA 
ser ser gin ser ser ala 

TGA ACG GGT TTC GGG TCG 
OPA (thr gly phe gly ser 
1 SEQ ID NO: 272 

TGG TCA AGG CCG GCG CGG 
trp ser arg pro ala arg 



TGC 
cys 

CGA 
arg 

GAC 
asp 

TGT 
cys 



CCC 
pro 

CGA 
arg 

CCG 
pro 

CGA 
arg 



AGG 
arg 

TCC 
ser 

GCA 
ala 

GCA 
ala 



TTG 
leu 

GGC 
gly 



CTC 
leu 

AGG 
arg 



GTG 
val 

GCA 
ala 



CCG GCC 
pro ala) 

AGG CCG 
arg pro 

CAG GAC 
gin asp 

CGC CTG 
arg leu 

CGT CAG 
arg gin 



TGC 


CGC 


GGG 


TCT 


CGG 


cys 


arg 


gly 


ser 


arg 


CGC 


TGG 


GCC 


GAA 


ACC 


arg 


trp 


ala 


glu 


thr 



SEQ ID NOS:137-138, 272-273 

FIG. 8A 



-SEQ ID NO: 139 

TTT GCG TGA TGT CCA ATG GCG AAA ACG ACG 

phe ala OPA (cys pro met ala lys thr thr 

62/21 t — SEQ ID NO: 140 

AGT TTT CGC GAT GAC GCT CGT TCT GAC CGG 

ser phe arg asp asp ala arg ser asp arg 

122/41 

GCG AAC CGT CGA TTC CAT GGG TGT CGA CGC 
ala asn arg arg phe his gly cys arg arg 
182/61 

GTT CCT GGG TTC GAC ACC ATT CGC CCA AAT 
val pro gly phe asp thr ile arg pro asn 
242/81 

CGT CTT GGC TGC CGC CCC ACT AGC GAC TGC 
arg leu gly cys arg pro thr ser asp cys 
302/101 

GCG AAA CGT CAC CGC GTT CGG GGC ACC AGA 
ala lys arg his arg val arg gly thr arg 
362/121 

CGG TCG GGC GCC ATC GAC GCC GGA CGA GGT 
arg ser gly ala ile asp ala gly arg gly 
422/141 

CGG CGA CGA TC 
arg arg arg) 

SEQ ID N( 



32/11 

CCT TGT CAT CGC AAT CGT CAG CAC CGG CCT 
pro cys his arg asn arg gin his arg pro 
92/31 

ACT TGT GAA CGG GTT TCG GGT CGA GGC CGA 
thr cys glu arg val ser gly arg gly arg 
152/51 

ATT CGT GGT CAA GGC CGG CGC GGC AGG ACC 
ile arg gly gin gly arg arg gly arg thr 
212/71 

CGA CCT GCC CCA GGT TGC TCG TGC GCC TGG 
arg pro ala pro gly cys ser cys ala trp 
272/91 

GCC GTC GAC GAT CCG GCA GGG CAC GTC AGC 
ala val asp asp pro ala gly his val ser 
332/111 

GCA CGG ACC CGG CAT GCC GCG GGT CTC GGA 
ala arg thr arg his ala ala gly leu gly 
392/131 

CGC GGT GTC GAG CAC GCT GGG CCG AAA CCT 
arg gly val glu his ala gly pro lys pro 



: 139-140 



FIG. 8B 




^ SEQ ID NO: 141 












33/11 


















ittg cgt 


GAT 


GTC 


CAA 


TGG 


CGA 


AAA 


CGA 


CGC 


CTT GTC 


ATC 


GCA 


ATC 


GTC 


AGC 


ACC 


GGC 


CTA 


(leu arg 


asp 


val 


gin 


trp 


arg 


lys 


arg 


arg 


leu val 


ile 


ala 


ile 


val 


ser 


thr 


gly 


leu 


63/21 - 


— SEQ ID 


NO: 


142 








93/31 


















GTT TTC 


GCG 


ATG 


ACG 


CTC 


GTT 


CTG 


ACC 


GGA 


CTT GTG 


AAC 


GGG 


TTT 


CGG 


GTC 


GAG 


GCC 


GAG 


val phe. a la 


met 


thr 


leu 


val 


leu 


thr 


crl v 


leu val 


asn 


cr 1 v 


phe 


arg 


val 


glu 


ala 


glu 


123/41 


















153/51 


















CGA ACC 


GTC 


GAT 


TCC 


ATG 


GGT 


GTC 


GAC 


GCA 


TTC GTG 


GTC 


AAG 


GCC 


GGC 


GCG 


GCA 


GGA 


CCG 


.arg thr 


val 


asp 


ser 


met 


gly 


val 


asp 


ala 


phe val 


val 


lvs 


ala 


cr 1 v 


ala 


ala 


crlv 


pro 


183/61 


















213/71 


















TTC CTG 


GGT 


TCG 


ACA 


CCA 


TTC 


GCC 


CAA 


ATC 


GAC CTG 


CCC 


CAG 


GTT 


GCT 


CGT 


GCG 


CCT 


GGC 


phe leu 


gly 


ser 


thr 


pro 


phe 


ala 


crln 


ile 


asp leu 


nro 


crln 


val 


ala 


arcf 


ala 


pro 


crlv 


243/81 


















273/91 


















GTC TTG 


GCT 


GCC 


GCC 


CCA 


CTA 


GCG 


ACT 


GCG 


CCG TCG 


ACG 


ATC 


CGG 


CAG 


GGC 


ACG 


TCA 


GCG 


val leu 


ala 


ala 


ala 


pro 


leu 


ala 


thr 


ala 


pro ser 


thr 


ile 




crln 


crlv 


thr 


ser 


ala 


303/101 


















333/111 


















CGA AAC 


GTC 


ACC 


GCG 


TTC 


GGG 


GCA 


CCA 


GAG 


CAC GGA 


CCC 


GGC 


ATG 


CCG 


CGG 


GTC 


TCG 


GAC 


arg asn 


val 


thr 


ala 


phe 


gly 


ala 


pro 


glu 


his gly 


pro 


gly 


met 


pro 


arg 


val 


ser 


asp 


363/121 


















393/131 


















GGT CGG 


GCG 


CCA 


TCG 


ACG 


CCG 


GAC 


GAG 


GTC 


GCG GTG 


TCG 


AGC 


ACG 


CTG 


GGC 


CGA 


AAC 


CTC 


gly arg 


ala 


pro 


ser 


thr 


pro 


asp 


glu 


val 


ala val 


ser 


ser 


thr 


leu 


gly 


arg 


asn 


leu 


423/141 




































GGC GAC 


GAT 


C 
































gly asp 


asp) 



































SEQ ZD NOS: 141-142 

FIG. 8C 



part of 


the 


nucleotide sequence 


of seq8A 


















1/1^ SEQ ID 


NO: 


143 






31/11 


















r CAG GTT 


GCT 


CGT 


GCG CCT GGC 


GTC 


TTG 


GCT GCC GCC 


CCA 


CTA 


GCG 


ACT 


GCG 


CCG 


TCG 


ACG 


(gin val 


ala 


arg 


ala pro gly 


val 


leu 


ala ala ala 


pro 


leu 


ala 


thr 


ala 


pro 


ser 


thr 


61/2jT^- 


SEQ ID NO: 144 






91/31 


















ATC CGG 


CAG 


GGC 


ACG TCA GCG 


CGA 


AAC 


GTC ACC GCG 


TTC 


GGG 


GCA 


CCA 


GAG 


CAC 


GGA 


CCC 


ile arg 


gin 


gly 


thr ser ala 


arg 


asn 


val thr ala 


phe 


gly 


ala 


pro 


glu 


his 


gly 


pro 


121/41 












151/51 


















GGC ATG 


CCG 


CGG 


GTC TCG GAC 


GGT 


CGG 


GCG CCA TCG 


ACG 


CCG 


GAC 


GAG 


GTC 


GCG 


GTG 


TCG 


gly met 


pro 


arg 


val ser asp 


gly 


arg 


ala pro ser 


thr 


pro 


asp 


glu 


val 


ala 


val 


ser 


181/61 






























AGC ACG 


CTG 


GGC 


CGA AAC CTC 


GGC 


GAC 


GAT C 


















ser thr 


leu 


gly 


arg asn leu 


gly 


asp 


asp) 



















SEQ XD NOS. 143-144 



FIG. 8A' 




1 / 1^ ^-— SEQ ID NO: 145 31/11 

TCT TGG CTG CCG CCC CAC TAG CGA CTG CGC CGT CGA CGA 
ser trp leu pro pro his)AMB(arg leu arg arg arg arg 

91/31 ^ SEQ ID NO: 147 

GAA ACG TCA CCG CGT TCG GGG CAC CAG AGC ACG GAC CCG 
glu thr ser pro arg ser gly his gin ser thr asp pro 
151/51 

GTC GGG CGC CAT CGA CGC CGG ACG AGG TCG CGG TGT CGA 
val gly arg his arg arg arg thr arg ser arg cys arg 

GCG ACG ATC 
ala thr ile) 



AGG TTG CTC GTG CGC CTG GCG 
(arg__leu leu val arg leu ala 

61/21* SEQ ID NO: 146 

TCC GGC AGG GCA CGT CAG CGC 
ser gly arg ala arg gin arg 
121/41 

GCA TGC CGC GGG TCT CGG ACG 
ala cys arg gly ser arg thr 
181/61 

GCA CGC TGG GCC GAA ACC TCG 
ala arg trp ala glu thr ser 



SEQ XD NOS: 145-147 

FIG. 8B" 



Seq8C 

1/1 -~-SEQ ID NO: 148 31/11 

CCA GGT TGC TCG TGC GCC TGG CGT CTT GGC TGC CGC CCC ACT AGC GAC TGC GCC GTC GAC 

( pro gl y cys ser cys ala trp arg leu gly cys arg pro thr ser asp cys ala val asp 
efl721 SEQ ID NO: 149 91/31 

GAT CCG GCA GGG CAC GTC AGC GCG AAA CGT CAC CGC GTT CGG GGC ACC AGA GCA CGG ACC 

asp pro ala gly his val ser ala lys arg his arg val arg gly thr arg ala arg thr 
121/41 151/51 

CGG CAT GCC GCG GGT CTC GGA CGG TCG GGC GCC ATC GAC GCC GGA CGA GGT CGC GGT GTC 

arg his ala ala gly leu gly arg ser gly ala ile asp ala gly arg gly arg gly val 
181/61 

GAG CAC GCT GGG CCG AAA CCT CGG CGA CGA TC 

glu his ala gly pro lys pro arg arg arg) 



SEQ ID NOS: 148-149 



FIG. 8C" 



nee Rv2563 predicted by Cole et al . 
ntaining seq8A' 



(Nature 393:537-544) and 



atg 
met 

121/41 ^SEQ ID NO: 150 

ctt ttt gcg get ttg cgt gat 
( leu phe ala ala leu arg asp 
181/61 ^"SEQ ID NO: 151 
age ace ggc eta gtt ttc gcg 
ser thr gly leu val phe ala 
241/81 

gtc gag gee gag cga acc gtc 
val glu ala glu arg thr val 
301/101 

gcg gca gga ccg ttc ctg ggt 
ala ala gly pro phe leu gly 
361/121 

cgt gcg cct ggc gtc ttg get 
arg ala pro gly val leu ala 
421/141 

ggc acg tea gcg cga aac gtc 
gly thr ser ala arg asn val 
481/161 

egg gtc teg gac ggt egg gcg 
arg val ser asp gly arg ala 
541/181 

ggc cga aac etc ggc gac gat 
gly arg asn leu gly asp asp 
601/201 

gtg ccc gag tea acc gcg ctg 
val pro glu ser thr ala leu 
661/221 

cag cag ttg gca tac aac gga 
gin gin leu ala tyr asn gly 
721/241 

cga cag etc ccg gac ggc tat 
arg gin leu pro asp gly tyr 
781/261 

cgc ccg ttg aag gtc gcg gtg 
arg pro leu lys val ala val 
841/281 

gcg gcg ttg ate gtc ggc teg 
ala ala leu ile val gly ser 
901/301 

gcg gtg ttc aag gcg ate ggc 
ala val phe lys ala ile gly 
961/321 

gcg gtc gtc gtc gcg ctg etc 
ala val val val ala leu leu 
1021/341 

ccg ttg ttc ccg atg act gtc 
pro leu phe pro met thr val 
1081/361 

gcg act gtg ate ggt ctg ctg 
ala thr val ile gly leu leu 
1141/381 

ccg gca eta gcg ttc gga ggt 
pro ala leu ala phe gly gly 









151 / 51 




y 


eaa 




uyo acta 


t~* n 
^— y a. 


va 1 


gin 


trn 


arg lys 


arg 








iill/ /X 




atg 


acg 


etc 


y i_ L- v_- 1_- y 


ace 


met 


thr 


T pi i 

X Li. 


V d X xcu 


thr 








^ / X / -7 X 






ccc 


«iy 


yy<- yt-<~ 


(~X p\ c> 

y ul 


asp 


ser 


me t 


^ y v 


asp 








331/111 




^y 


aea 


eca 


l» i> *v y v.- 


eaa 


ser 


thr 


pro 




gin 








391/131 




gee 


gec 


eca 


eta gcg 


act 


ala 


ala 


pro 


leu ala 


thr 








*± J X / X J X 




ace 


rrrrr 




yyy y Lcl 


eca 


thr 


ala 


phe 


rrl v a ~\ a 


pro 








511/ 17 1 




c c a 


teg 


ac g 


• y y ci *w 


y ciy 


pro 


ser 


thr 




glu 








R *7 1 / 1 Q 1 
J / X / X -7 X 




ctg 


caa 


gtg 




cgc 


1 eu 


y xn 


val 


y xy clxcl 


=a y- (~r 
CLX y 








/Oil 
O -Z> X / ^ X X 




gca 


aag 


.a i- t- 


^3 ^3 /""« 

t^- Cta^ 


ate 


CllCl 


xy&> 


J — LC 


i^x w d Oil 


XXV 








U J J. / ^ J X 




cag 


ccg 


aea 


ate acrt 


teg 


gin 


pro 


thr 


ile ser 


ser 








751/251 




cag 


ace 


gtc 


aat cga 


crccr 


crln 


thr 


val 


asn arg 


ala 








811/271 




crat 


crccr 


ate 


acg gtt 


gtg 


asp 


ala 


ile 


thr val 


val 








871/291 




a tcr 


crtc 


tac 


etc tct 


gcg 


val 


val 


tyr 


leu ser 


ala 








931/311 




gtg 


ccg 


acg 


cgc teg 


att 


val 


pro 


thr 


arg ser 


ile 








991/331 




gcg 


gcg 


gtg 


gtt ggc 


ggc 


ala 


ala 


val 


val gly 


gly 








1051/351 


gtg 


gta 


ccc 


ctg agt 


gee 


val 


val 


pro 


leu ser 


ala 








1111/371 


gee 


age 


gtc 


gca gga 


ctg 


ala 


ser 


val 


ala gly 


leu 



ccc tag 
pro) AMB 



cgc ctt gtc ate gca ate gtc 
arg leu val ile ala ile val 

gga ctt gtg aac ggg ttt egg 
gly leu val asn gly phe arg 

gca ttc gtg gtc aag gec ggc 
ala phe val val lys ala gly 

ate gac ctg ccc cag gtt get 
ile asp leu pro gin val ala 

gcg ccg teg acg ate egg cag 
ala pro ser thr ile arg gin 

gag cac gga ccc ggc atg ccg 
glu his gly pro gly met pro 

gtc gcg gtg teg age acg ctg 
val ala val ser ser thr leu 

act ttg egg ate gtc ggc ate 
thr leu arg ile val gly ile 

ttc ctg acc acc gaa ggc eta 
phe leu thr thr glu gly leu 

ate ggg ate gac ggg atg ccc 
ile gly ile asp gly met pro 

gat get gtc age gat ctg atg 
asp ala val ser asp leu met 

gcg gtc ttg ctg tgg ate gtt 
ala val leu leu trp ile val 

ttg gag egg ctg cgt gac ttt 
leu glu arg leu arg asp phe 

ctg gec ggg ctg gcg ctg cag 
leu ala gly leu ala leu gin 

ate ctt teg ctg ctg ttg gcg 
ile leu ser leu leu leu ala 

ttc gtg gcg eta ccg gcg ate 
phe val ala leu pro ala ile 

egg cgc gtg gtg gcg ate gat 
arg arg val val ala ile asp 



SEQ XD NOS: 150-151 

FIG. 8D 



ORF predicted by Cole et al . {Nature 393:537-544) and containing Rv2563 
1/ 1 SEQ ID NO: 152 31/11 

tag gtt tea aga agg cct gtg cag gtt tec gca gec tgg gec gcg gcg cca ccg aag age 
AMB (val ser arg arg pro val gin val ser ala ala trp ala ala ala pro pro lys ser 
e\/2X^ SEQ ID NO: 153 91/31 

ccg ccg aaa tgg get aat egg gtt cgc ttg get cga teg ccg atg ate teg ace gee acg 
pro pro lys trp ala asn arg val arg leu ala arg ser pro met ile ser thr ala thr 
121/41 151/51 

ace gac ccc etc ace teg gtc gaa cct egg cga ace aac gcg gca acg cca gee cat gat 
thr asp pro leu thr ser val glu pro arg arg thr asn ala ala thr pro ala his asp 
181/61 211/71 

cat ttg att ggg tec acg gaa gca ggt age ttc cgt cgc atg ctt ttt gcg get ttg cgt 
his leu ile gly ser thr glu ala gly ser phe arg arg met leu phe ala ala leu arg 
241/81 271/91 

gat gtc caa tgg cga aaa cga cgc ctt gtc ate gca ate gtc age ace ggc eta gtt ttc 
asp val gin trp arg lys arg arg leu val ile ala ile val ser thr gly leu val phe 
301/101 331/111 

gcg atg acg etc gtt ctg acc gga ctt gtg aac ggg ttt egg gtc gag gee gag cga ace 
ala met thr leu val -leu thr gly leu val asn gly phe arg val glu ala glu arg thr 
361/121 391/131 

gtc gat tec atg ggt gtc gac gca ttc gtg gtc aag gec ggc gcg gca gga ccg ttc ctg 
val asp ser met gly val asp ala phe val val lys ala gly ala ala gly pro phe leu 
421/141 451/151 

ggt teg aca cca ttc gee caa ate gac ctg ccc cag gtt get cgt gcg cct ggc gtc ttg 
gly ser thr pro phe ala gin ile asp leu pro gin val ala arg ala pro gly val leu 
481/161 511/171 

get gec gee cca eta gcg act gcg ccg teg acg ate egg cag ggc acg tea gcg cga aac 
ala ala ala pro leu ala thr ala pro ser thr ile arg gin gly thr ser ala arg asn 
541/181 571/191 

gtc acc gcg ttc ggg gca cca gag cac gga ccc ggc atg ccg egg gtc teg gac ggt egg 
val thr ala phe gly ala pro glu his gly pro gly met pro arg val ser asp gly arg 
601/201 631/211 

gcg cca teg acg ccg gac gag gtc gcg gtg teg age acg ctg ggc cga aac etc ggc gac 
ala pro ser thr pro asp glu val ala val ser ser thr leu gly arg asn leu gly asp 
661/221 691/231 

gat ctg caa gtg ggt gcg cgc act ttg egg ate gtc ggc ate gtg ccc gag tea acc gcg 
asp leu gin val gly ala arg thr leu arg ile val gly ile val pro glu ser thr ala 
721/241 751/251 

ctg gca aag att ccc aac ate ttc ctg acc acc gaa ggc eta cag cag ttg gca tac aac 
leu ala lys ile pro asn ile phe leu thr thr glu gly leu gin gin leu ala tyr asn 
781/261 811/271 

gga cag ccg aca ate agt teg ate ggg ate gac ggg atg ccc cga cag etc ccg gac ggc 
gly gin pro thr ile ser ser ile gly ile asp gly met pro arg gin leu pro asp gly 
841/281 871/291 

tat cag acc gtc aat cga gcg gat get gtc age gat ctg atg cgc ccg ttg aag gtc gcg 
tyr gin thr val asn arg ala asp ala val ser asp leu met arg pro leu lys val ala 
901/301 931/311 

gtg gat gcg ate acg gtt gtg gcg gtc ttg ctg tgg ate gtt gcg gcg ttg ate gtc ggc 
val asp ala ile thr val val ala val leu leu trp ile val ala ala leu ile val gly 
961/321 991/331 

teg gtg gtc tac etc tct gcg ttg gag egg ctg cgt gac ttt gcg gtg ttc aag gcg ate 
ser val val tyr leu ser ala leu glu arg leu arg asp phe ala val phe lys ala ile 
1021/341 1051/351 

ggc gtg ccg acg cgc teg att ctg gee ggg ctg gcg ctg cag gcg gtc gtc gtc gcg ctg 
gly val pro thr arg ser ile leu ala gly leu ala leu gin ala val val val ala leu 
1081/361 1111/371 

etc gcg gcg gtg gtt ggc ggc ate ctt teg ctg ctg ttg gcg ccg ttg ttc ccg atg act 
leu ala ala val val gly gly ile leu ser leu leu leu ala pro leu phe pro met thr 
1141/381 1171/391 

gtc gtg gta ccc ctg agt gee ttc gtg gcg eta ccg gcg ate gcg act gtg ate ggt ctg 
val val val pro leu ser ala phe val ala leu pro ala ile ala thr val ile gly leu 
1201/401 1231/411 

ctg gec age gtc gca gga ctg egg cgc gtg gtg gcg ate gat ccg gca eta gcg ttc gga 
leu ala ser val ala gly leu arg arg val val ala ile asp pro ala leu ala phe gly 
1261/421 
ggt ccc tag 
gly pro) AMB 

SEQ XD NOS: 152-153 



FIG. 8E 



sequence of Rv0072 predicted by Cole et al. (Nature 393:537-544) and 
exhibiting more than 77% similarity with Seq8D' 
1/ 1 ^ SEQ ID NO: 154 31/11 

atg etc ttc gcg gec ctg cgt gac atg caa tgg aga aag cgc cgc ctg gtc ate acg ate 
( Met le u phe ala ala leu arg asp met gin trp arg lys arg arg leu val ile thr ile 
61/21 ' SEQ ID NO: 155 91/31 

ate age ace ggg ctg ate ttc ggg atg acg ctt gtt ttg ace gga etc gcg aac ggc ttc 
ile ser thr gly leu ile phe gly met thr leu val leu thr gly leu ala asn gly phe 
121/41 151/51 

egg gtg gag gec egg cac acc gtc gat tec atg ggt gtc gat gta ttc gtc gtc aga tec 
arg val glu ala arg his thr val asp ser met gly val asp val phe val val arg ser 
181/61 211/71 

ggc get get gga cct ttt ctg ggt tea ata ccg ttt ccc gat gtt gac ctg gee cga gtg 
gly ala ala gly pro phe leu gly ser ile pro phe pro asp val asp leu ala arg val 
241/81 271/91 

gec get gaa ccc ggt gtc atg gee gcg gec ccg ttg ggc age gtg ggg acg ate atg aaa 
ala ala glu pro gly val met ala ala ala pro leu gly ser val gly thr ile met lys 
301/101 331/111 

gaa ggc acg teg acg cga aac gtc acg gtc ttc ggc gcg ccc gag cac gga cct ggc atg 
glu gly thr ser thr arg asn val thr val phe gly ala pro glu his gly pro gly met 
361/121 391/131 

cca egg gtc tea gag ggt egg tea ccg teg aaa ccg gac gaa gtc gcg gca teg age acg 
pro arg val ser glu gly arg ser pro ser lys pro asp glu val ala ala ser ser thr 
421/141 451/151 

atg ggc cga cac etc ggt gac act gtc gag gtc ggc gcg cgc aga ttg egg gtc gtt ggc 
met gly arg his leu gly asp thr val glu val gly ala arg arg leu arg val val gly 
481/161 511/171 

att gtg ccg aat tec acc gcg ctg gee aag ate ccc aat gtc ttc etc acg acc gag ggc 
ile val pro asn ser thr ala leu ala lys ile pro asn val phe leu thr thr glu gly 
541/181 571/191 

tta cag aaa ttg gcg tac aac ggg cag ccg aat ate acg tec ate ggg ate ata ggt atg 
leu gin lys leu ala tyr asn gly gin pro asn ile thr ser ile gly ile ile gly met 
601/201 631/211 

ccc cga cag ctg ccg gag ggt tac cag act ttc gat egg gtg ggc get gtc aat gat ttg 
pro arg gin leu pro glu gly tyr gin thr phe asp arg val gly ala val asn asp leu 
661/221 691/231 

gtg cgc cca ttg aag gtc gca gtg aat teg ate teg ate gtg get gtt ttg ctg tgg att 
val arg pro leu lys val ala val asn ser ile ser ile val ala val leu leu trp ile 
721/241 751/251 

gtg gcg gtg ctg ate gtc ggc teg gtg gtg tac ctt teg get ctt gag egg eta cgt gac 
val ala val leu ile val gly ser val val tyr leu ser ala leu glu arg leu arg asp 
781/261 811/271 

ttc gcg gtg ttc aag gcg att ggc acg cca acg cgc teg att atg gec ggg etc gca tta 
phe ala val phe lys ala ile gly thr pro thr arg ser ile met ala gly leu ala leu 
841/281 871/291 

cag gcg ctg gtc att gcg ttg ctt gcg gcg gtg gtg ggc gtc gtc ctg gcg cag gtg ttg 
gin ala leu val ile ala leu leu ala ala val val gly val val leu ala gin val leu 
901/301 931/311 

gca cca ctg ttt ccg atg att gtc gcg gta ccc gtc ggt get tac ctg gcg eta ccg gtg 
ala pro leu phe pro met ile val ala val pro val gly ala tyr leu ala leu pro val 
961/321 991/331 

gee gcg ate gtc ate ggt ctg ttc get agt gtt gec gga ttg aag cgc gtg gtg acg gtc 
ala ala ile val ile gly leu phe ala ser val ala gly leu lys arg val val thr val 
1021/341 

gat ccc gcg cag gcg ttc gga ggt ccc tag 

asp pro ala gin ala phe gly gly pro) AMB 

V 

SEQ ID NOS: 154-155 ( MAR 2 6 2MB K] 

FIG. 8F V*--*^ 



Seq8H : ORF predicted by Cole et al . (Nature 393:537-544) and containing se<g8G 



1/ 1 : ^- SEQ ID NO: 156 
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gcg aac 
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egg 
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aga 
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get 
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acg ate 
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aaa 


gaa 
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teg 
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~ *_ *_ 
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etc 
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He 
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ala leu 


aia 
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ne 
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phe 
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-J *± X / X O X 


















d / i / iy i 


















acg acc 


y ay 


y y <~ 


tta 


cag 


aaa 


c ty 


gcg 


cac 


aac yyy 


cag 


ccg 


a a f- 

aac 


4- 

acc 


acg 


tec 
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ggg 


UI1X Ulli 


g lu 


y xy 
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gin 


lys 
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asn gly 
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asn 
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enr 
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lie 


giy 


fin i / 9 m 
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1 /Oil 
D J X / Z X X 


















ate a t a 


ggt 


d l. y 


ccc 


cga 


cag 


ccg 


ccg 


gag 


ggt tac 


cag 




t- i- r* 


ga t 


egg 


gtg 
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gc t 


x le x 1 e 


y j-y 


me t 
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O O X / ^ ^ X 


















o y x / ^ ji 


















gtc aat 


gat 


u y 


gtg 


cgc 
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t- ty 


aag 


gtc 


yea gtg 


t- 
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teg 


ace 
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get 
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1 eu 
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val 


ala 
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/ D X / £ 3 X 


















ttg ctg 


tgg 


=}+-+- 

o. L. L- 


gtg 


gcg 


gtg 
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gtc 


ggc ccg 


gtg 
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r* t- t- 


teg 
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gag 
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J- x 
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val 
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g lu 
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oll/^ /X 


















egg eta 


cgt 


gac 


ccc 


y *-y 


y t-y 


4- 4- 

cue 


aag 


gcg 


att ggc 


acg 


cca 


acg 


cgc 


teg 


acc 


acg 


gee 


arg leu 


arg 


asp 


phe 


ala 


val 


phe 


lys 


aia 


ile gly 


4- V> 

enr 


pro 


4- Vi -v- 

tnr 


arg 


ser 


lie 


met 


aia 


841/281 


















871/291 


















ggg etc 


gca 


tta 


cag 


gcg 


ctg 


gtc 


acc 


gcg 


ttg ctt 


gcg 


gcg 


gtg 


gtg 


ggc 


gtc 


gtc 


ctg 


gly leu 


ala 


leu 


gin 


ala 


leu 


vd _L 




axa 


leu leu 


ala 


ala 


Vd -L 


V d -L 


giy 


vdi 


vai 


_L tr LJ. 


901/301 


















931/311 


















gcg cag 


gtg 


ttg 


gca 


cca 


ctg 


ttt 


ccg 


atg 


att gtc 


gcg 


gta 


ccc 


gtc 


ggt 


get 


tac 


ctg 


ala gin 


val 
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ala 


pro 


leu 


phe 


pro 


met 


ile val 


ala 


val 


pro 


val 


giy 


ala 


tyr 


leu 


961/321 


















991/331 


















gcg eta 


ccg 


gtg 


gec 


gcg 


ate 
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ate 


ggt 


ctg ttc 


get 


agt 
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ile 
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1021/341 
















1051/351 
















gtg gtg 
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cag 
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SEQ ZD NOS: 156-157 



FIG. 8G 



MAR 2 6 2003 S 




SEQ XD MO: 158 



CGA GGC CGA GCG AAC CGT CGA TTC CAT GGG 
( arg gl y arg ala asn arg arg phe his gly 
61/21 " ■ — SEQ ID NO: 159 







Vjr J. J. 


ppm 


\J\J\J 


rnmn 






A mm 
All 


al v a ttt 


thr 


val 


pro 


gly 


phe 


a sp 


L.I1X. 


lift 
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121/41 


















TGC GCC 


TGG 


CGT 


CTT 


GGC 


TGC 


CGC 


CCC 


ACT 


cys ala 


trp 


arg 


leu 


gly 
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arg 


pro 


thr 


181/61 


















CAC GTC 


AGC 


GCG 


AAA 


CGT 


CAC 


CGC 


GTT 


CGG 


his val 


ser 


ala 


lys 


arg 


his 


arg 


val 


arg 


241/81 


















GGT CTC 


GGA 


CGG 


TCG 


GGC 


GCC 


ATC 


GAC 


GCC 


gly leu 


gly 


arg 


ser 


gly 


ala 


ile 


asp 


ala 


301/101 


















CCG AAA 


CCT 


CGG 


CGA 


CGA 


TC 








pro lys 


pro 


arg 


arg 


arg) 











31/11 

TGT CGA CGC ATT CGT GGT CAA GGC CGG CGC 
cys arg arg ile arg qly gin gly arg arg 
91/31 

CGC CCA AAT CGA CCT GCC CCA GGT TGC TCG 

arg pro asn arg pro ala pro gly cys ser 
151/51 

AGC GAC TGC GCC GTC GAC GAT CCG GCA GGG 

ser asp cys ala val asp asp pro ala gly 
211/71 

GGC ACC AGA GCA CGG ACC CGG CAT GCC GCG 

gly thr arg ala arg thr arg his ala ala 
271/91 

GGA CGA GGT CGC GGT GTC GAG CAC GCT GGG 

gly arg gly arg gly val glu his ala gly 



SEQ ID NOS: 158-159 

FIG. 9A 



j — SEQ ID NO: 160 

GAG GCC GAG CGA ACC GTC GAT TCC ATG 
( crlu a la glu arg thr val asp ser met 

62/21" — ■ SEQ ID NO: 161 

GCA GGA CCG TTC CTG GGT TCG ACA CCA 
ala gly pro phe leu gly ser thr pro 
122/41 

GCG CCT GGC GTC TTG GCT GCC GCC CCA 
ala pro gly val leu ala ala ala pro 
182/61 

ACG TCA GCG CGA AAC GTC ACC GCG TTC 
thr ser ala arg asn val thr ala phe 
242/81 

GTC TCG GAC GGT CGG GCG CCA TCG ACG 
val ser asp gly arg ala pro ser thr 
302/101 

CGA AAC CTC GGC GAC GAT C 
arg asn leu gly asp asp) 



32/11 

GGT GTC GAC GCA TTC GTG GTC AAG GCC GGC GCG 
gly val asp ala phe val val lys ala gly ala 
92/31 

TTC GCC CAA ATC GAC CTG CCC GAG GTT GCT CGT 
phe ala gin ile asp leu pro gin val ala arg 
152/51 

CTA GCG ACT GCG CCG TCG ACG ATC CGG CAG GGC 
leu ala thr ala pro ser thr ile arg gin gly 
212/71 

GGG GCA CCA GAG CAC GGA CCC GGC ATG CCG CGG 
gly ala pro glu his gly pro gly met pro arg 
272/91 

CCG GAC GAG GTC GCG GTG TCG AGC ACG CTG GGC 
pro asp glu val ala val ser ser thr leu gly 



SEQ ID NOS: 160-161 



FIG. 9B 
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SEQ ID NO: 


164- — - 
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SEQ ID NOS : 162- 
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p — SEQ ID NO: 165 












31/11 
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61/21^- 


SEQ ID NO: 166 








91/31 


















GCT CTT 
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CAG CCG 


GGT 
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gly 


1 eu 


ala 
aid 


121/41 


















151/51 


















ATC AAT 


CGC 


CGG 


CTC 


GCG 


GAC 


GTA 


GAT 


AAT 


CAG CTC 


ACC 


GTT 


GGG 


ACC 


GAC 


CTC 


GAC 


CAG 


ile asn 


arg 


arg 


leu 


ala 


asp 


val 


asp 


asn 


gin leu 


thr 


val 


gly 


thr 


asp 


leu 


asp 


gin 


181/61 


















211/71 


















GGG TCC 


TTT 


GTG 


ACT 


GCC 


GGG 


CTT 


GAC 


GCG 


GAC GAC 


CAC 


AGA 


GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


gly ser 


phe 


val 


thr 


ala 


gly 


leu 


asp 


ala 


asp asp 


his 


arg 


val 


gly 


his 


arg 


leu 


c* -U y 


241/81 


















271/91 


















CTA CCG 


TTC 


TGA 


CCT 


GGG 


GCT 


GCG 


TGG 


GCG 


CCG ACG 


ACG 


TGA 


GGC 


ACG 


TCA 


TGT 


CTC 


AGC 


leu pro 


phe)OPA(pro 


gly 


ala 


ala 


trp 


ala 


pro thr 


thr)OPA<gly 


thr 


ser 


cys 


leu 


ser 


301/101 




c SEQ ID 


NO: 


167 




331/111 






SEQ ID NO; 


168 




GGC CCA 


CCG 


CCA 


CCT 


CGG 


TCG 


CCG 


GCA 


GTA 


TGT CAG 


CAT 


GTG 


CAG 


ATG 


ACT 


CCA 


CGC 


AGC 


gly pro 


pro 


pro 


pro 


arg 


ser 


pro 


ala 


val 


cys gin 


his 


val 


gin 


met 


thr 


pro 


arg 


ser 


361/121 


















391/131 


















CTT GTT 


CGC 


ATC 


GTT 


GGT 


GTC 


GTG 


GTT 


GCG 


ACG ACC 


TTG 


GCG 


CTG 


GTG 


AGC 


GCA 


CCC 


GCC 


leu val 


arg 


ile 


val 


gly 


val 


val 


val 


ala 


thr thr 


leu 


ala 


leu 


val 


ser 


ala 


pro 


ala 


421/141 




































GGC GGT 


CGT 


GCC 


GCG 


CAT 


GCG 


GAT 


C 






















gly gly 


arg 


ala 


ala 


his 


ala 


asp) 

























SEQ ID NOS: 165-168 



FIG. 10A 



£>lL\j 


ID NO: 


169 






























X An 


CTC 


AGA 


CGG 


AAA 


CGC 


TTG 


AAC 




CjACj GTC 


GCT 


CCG 


GAC 


ACC 


AAT 


TTG 


ACT 


CGG 


OCH(arg 


leu 


arg 


arg 


lys 


arg 


leu 


asn 


arg 


glu val 


ala 


pro 


asp 


thr 


asn 


leu 


thr 


arg 


62/21^— 


SEQ 


XD NO: 


17 0 




























CTC TTT 


GGC 


AAT 


TGA 


AGG 


TGA 


GCT 


GCG 


A(j(- 


ACjC CGG 


GTG 


ACC 


GCA 


TCG 


TTG 


GCC 


TTG 


CCA 


leu phe 


gly 


asn)OPA 


arg 


OPA(ala 


ala 


ser 


ser arg 


Val 


thr 


ala 


ser 


leu 


ala 


leu 


pro 


122/41 


SEQ XD 


NO: 


171 ' 


























TCA ATC 


GCC 


GGC 


TCG 


CGG 


ACG 


TAG 


ATA 


ATC 


AGC TCA 


CCG 


TTG 


GGA 


CCG 


ACC 


TCG 


ACC 


AGG 


ser ile 


ala 


gly 


ser 


arg 


thr)AMB(ile 


ile 


ser ser 


pro 


leu 


gly 


pro 


thr 


ser 


thr 


arg 


182/61 




SEQ XD NO: 


172 ~" 






2 12/71 


















GGT Cp&g«EG TGA CTG 
gly 'pro n.e'uTt>PA (leu 


CCG 
pro 


GGC 

gly 


TTG 
leu 


ACG 
thr 


CGG 
arg 


ACG ACC 
thr thr 


ACA 
thr 


GAG 
glu 


TCG 
ser 


GTC 
val 


ATC 
ile 


GCC 
ala 


TAA GGC 
)OCH(gly 


242/81 






^SEQ 


XD NO: : 


173 






272/91 




SEQ XD 


NO: 


174 


TAC CGT 


TCT 


GAC 


CTG 


GGG 


CTG 


CGT 


GGG 


CGC 


CGA CGA 


CGT 


GAG 


GCA 


CGT 


CAT 


GTC 


TCA 


GCG 


tyr arg 


ser 


asp 


leu 


gly 


leu 


arg 


gly 


arg 


arg arg 


arg 


glu 


ala 


arg 


his 


val 


ser 


ala 


302/101 


















332/111 


















GCC CAC 


CGC 


GAC 


CTC 


GGT 


CGC 


CGG 


CAG 


TAT 


GTC AGC 


ATG 


TGC 


AGA 


TGA 


CTC 


CAC 


GCA 


GCC 


ala his 


arg 


his 


leu 


gly 


arg 


arg 


gin 


tyr 


val ser 


met 


cys 


arg)OPA(leu 


his 


ala 


ala 


362/121 


















392/131 


SEQ 


XD NO: 


175 










TTG TTC 


GCA 


TCG 


TTG 


GTG 


TCG 


TGG 


TTG 


CGA 


CGA CCT 


TGG 


CGC 


TGG 


TGA 


GCG 


CAC 


CCG 


CCG 


leu phe 


ala 


ser 


leu 


val 


ser 


trp 


leu 


arg 


arg pro 


trp 


arg 


trp)OPA(ala 


his 


pro 


pro 


422/141 


















SEQ XD NO 


i 176 — 








GCG GTC 


GTG 


CCG 


CGC 


ATG 


CGG 


ATC 
























ala val 


val 


pro 


arg 


met 


arg 


Ile) 

























SEQ ID NOS: 170-17 6 

FIG. 10B 



^SEQ ID NO: 177 SEQ ID NO: 179 33/11 

AAC GAC TCA GAC GGA AAC GCT TGA^ACC GCG AGG TCG CTC CGG ACA CCA ATT TGA CTC GGC 
( asn asp ser asp gly asn ala)OPA(thr ala arg ser leu arg thr pro ile)OPA(leu gly 
63/21 ^SEQ ID NO: 178 93/31 SEQ ID NO: 180 ~ 



TCT TTG 


GCA 


ATT 


GAA 


GGT 


GAG 


CTG 


CGA 


GCA 


GCC GGG 


TGA 


CCG 


CAT 


CGT 


TGG 


CCT 


TGC 


CAT 


ser leu 


ala 


ile 


glu 


gly 


glu 


leu 


arg 


ala 


ala gly)OPA< 


pro 


his 


arg 


trp 


pro 


cys 


his 


123/41 


















153/51 


SEQ ID NO: 


181 








CAA TCG 


CCG 


GCT 


CGC 


GGA 


CGT 


AGA 


TAA 


TCA 


GCT CAC 


CGT 


TGG 


GAC 


CGA 


CCT 


CGA 


CCA 


GGG 


gin ser 


pro 


ala 


arg 


gly 


arg 


arg) OCH( ser 


ala his 


arg 


trp 


asp 


arg 


pro 


arg 


pro 


gly 


183/61 






SEQ 


ID NO: 


182 






213/71 


















GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT 


TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


val leu 


cys 


asp 


cys 


arg 


ala)OPA(arg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


243/81 






SEQ 


ID NO: 


183- 






273/91 


















ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC 


GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


CGG 


thr val 


leu 


thr 


trp 


gly 


cys 


val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


303/101 


















333/111 


















CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC 


GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


CAG 


CCT 


pro thr 


ala 


thr 


ser 


val 


ala 


gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin 


pro 


363/121 


















393/131 


















TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT 


GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


CGG 


cys ser 


his 


arg 


trp 


cys 


arg 


gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu 


arg 


thr 


arg 


arg 



423/141 

CGG TCG TGC CGC GCA TGC GGA TC 
arg ser cys arg ala cys gly) 

SEQ ID NOS: 177-183 



FIG. 10C 



I 6 7BO 



^ SEQ XD NO: 184 31/11 

CCC GAA GAG GTC CCC CGT TTT GTT AAT TTT TAA AAA ATT TGT GTC AC A AAC CGG GGT ACC 
( pjro g lu glu val pro arg phe val asp phe)OCH(lys ile cys val thr lys arg gly thr 

61 / 2 r~~" SEQ ID NO: 185 91/31*" SEQ ID NO: 186 

AAG GCA TAA AAC CTA GTA CCT GGG GCG GCG GAT TCA ACG AAA ACC GAG TGG GGG TAG TCA 

lys ala)OCH(asn leu val pro gly ala ala asp ser thr lys thr glu trp gly)AMB(ser 

121/41 ^SEQ ID NO: 187 151/51 SEQ ID NO: 188-^ 

GGG GCG TGC ATT CCG ACG ACC CTG TAC GAC CCG CTG GTG GCA ACG CCG ATG AGT GCG CCG 

gly ala cys ile pro thr thr leu tyr asp pro leu val ala thr pro met ser ala pro 

181/61 211/71 

ACG AAG GCC GAG CGA CGG GCT GCC GGC GCT GAC CGC CGC GGA AGC CGC CGA GTG CAT GGT 

thr lys ala glu arg arg ala ala gly ala asp arg arg gly ser arg arg val asp gly 

241/81 271/91 

CAC CAC CGC CCG CAC CCG ACC GGT ACG GAT CGC GCC TCG GGT TAC CGT CGC CGT CAA CGC 

his his arg pro his pro thr gly thr asp arg ala ser gly tyr arg arg arg gin arg 

301/101 331/111 

GCT GGA CAG CAT CGG TCC CCG CTG GGT CAA TGC ACT CAT GCA GCG CCG CAA CGA ACA GCT 

ala gly gin his arg ser pro leu gly gin cys thr his ala ala pro gin arg thr ala 

361/121 391/131 

CAA CCC TTG AAC CGG GTC CCG GCC TGC CGA CCC TCG GCC GCC GGC GTG CCG CTA CGT GAT 

gin pro leu asn arg val pro ala cys arg pro ser ala ala gly val pro leu arg asp 

421/141 451/151 

AGA CAC AGG GCC ATG GAA ATC CTG GCC AGC CGG ATG CTA CTT CGG CCG GCG GAC TAT CAG 

arg his arg ala met glu ile leu ala ser arg met leu leu arg pro ala asp tyr gin 
481/161 

CGG TCG CTG AGC TTC TAC CGT GAC CAG ATC 

arg ser leu ser phe tyr arg asp gin ile) 

SEQ ID NOS: 184-188 



FIG. 11 A 



^^SEQ ID NO: 189 












32/11 


















CCG AAG 


AGG 


TCC 


CCC 


GTT 


TTG 


TTA 


ATT 


TTT 


AAA AAA 


TTT 


GTG 


TCA 


CAA 


AGC 


GGG 


GTA 


CCA 


(gro lys 


arg 


ser 


pro 


val 


leu 


leu 


ile 


phe 


lys lys 


phe 


val 


ser 


gin 


ser 


gly 


val 


pro 


62/21^^ 


SEQ 


ID NO: 


190 










92/31 


















AGG CAT 


AAA 


ACC 


TAG 


TAC 


CTG 


GGG 


CGG 


CGG 


ATT CAA 


CGA 


AAA 


CCG 


AGT 


GGG 


GGT 


AGT 


CAG 


arg his 


lys 


thr)AMB( tyr 


leu 


gly 


arg 


arg 


ile gin 


arg 


lys 


pro 


ser 


gly 


gly 


ser 


gin 


122/41 




SEQ ID NO: 191 






152/51 


















GGG CGT 


GCA 


TTC 


CGA 


CGA 


CCC 


TGT 


ACG 


ACC 


CGC TGG 


TGG 


CAA 


CGC 


CGA 


TGA 


GTG 


CGC 


CGA 


gly arg 


ala 


phe 


arg 


arg 


pro 


cys 


thr 


thr 


arg trp 


trp 


gin 


arg 


arg) OPA(val 


arg 


arg 


182/61 


















212/71 


SEQ ID NO: 192-^ 


w 






CGA AGG 


ccG 


AGC 


GAC 


GGG 


CTG 


CCG 


GCG 


CTG 


ACC GCC 


GCG 


GAA 


GCC 


GCC 


GAG 


TGG 


ATG 


GTC 


arg arg 


pro 


ser 


asp 


gly 


leu 


pro 


ala 


leu 


thr ala 


ala 


glu 


ala 


ala 


glu 


trp 


met 


val 


242/81 


















272/91 


















ACC ACC 


GCC 


CGC 


ACC 


CGA 


CCG 


GTA 


CGG 


ATC 


GCG CCT 


CGG 


GTT 


ACC 


GTC 


GCC 


GTC 


ACC 


GCG 


thr thr 


ala 


arg 


thr 


arg 


pro 


val 


arg 


ile 


ala pro 


arg 


val 


thr 


val 


ala 


val 


asn 


ala 


302/101 


















332/111 


















CTG GAC 


AGC 


ATC 


GGT 


CCC 


CGC 


TGG 


GTC 


AAT 


GCA CTC 


ATG 


CAG 


CGC 


CGC 


AAC 


GAA 


CAG 


CTC 


leu asp 


ser 


ile 


gly 


pro 


arg 


trp 


val 


asn 


ala leu 


met 


gin 


arg 


arg 


asn 


glu 


gin 


leu 


362/121 


















392/131 


















AAC CCT 


TGA 


ACC 


GGG 


TCC 


CGG 


CCT 


GCC 


GAC 


CCT CGG 


CCG 


CCG 


GCG 


TGC 


CGC 


TAC 


GTG 


ATA 


asn pro)OPA(thr 


gly 


ser 


arg 


pro 


ala 


asp 


pro arg 


pro 


pro 


ala 


cys 


arg 


tyr 


val 


ile 


422/141 




^ — SEQ ID NO: 


193 






452/151 


















GAC ACA 


GGG 


CCA 


TGG 


AAA 


TCC 


TGG 


CCA 


GCC 


GGA TGC 


TAC 


TTC 


GCC 


CGG 


CGG 


ACT 


ATC 


AGC 


asp thr 


gly 


pro 


trp 


lys 


ser 


trp 


pro 


ala 


gly cys 


tyr 


phe 


gly 


arg 


arg 


thr 


ile 


ser 



482/161 

GGT CGC TGA GCT TCT ACC GTG ACC AGA TC 
gly ar) OPA(ala ser thr val thr arg) 
V SE q id no: 194 

SEQ ID NOS: 189-194 



FIG. 11 B 



^ SEQ ID NO: 195 33/11 

CGA AGA GGT CCC CCG TTT TGT TAA TTT TTA AAA AAT TTG TGT CAC AAA GCG GGG TAC CAA 
(ar g a rg gly pro pro phe cys)OCH(phe leu l ys asn leu cys his lys ala gly tyr gin 
6372T^-SEQ ID NO: 196 -"**9 3 / 3 1^- SEQ ID NO: 197 

GGC ATA AAA CCT AGT ACC TGG GGC GGC GGA TTC AAC GAA AAC CGA GTG GGG GTA GTC AGG 
gly ile lys pro ser thr trp gly gly gly phe asn glu asn arg val gly val val arg 
123/41 153/51 

GGC GTG CAT TCC GAC GAC CCT GTA CGA CCC GCT GGT GGC AAC GCC GAT GAG TGC GCC GAC 
gly val his ser asp asp pro val arg pro ala gly gly asn ala asp glu cys ala asp 
183/61 213/71 

GAA GGC CGA GCG ACG GGC TGC CGG CGC TGA CCG CCG CGG AAG CCG CCG AGT GGA TGG TCA 
glu gly arg ala thr gly cys arg arg)OPA(pro pro arg lys pro pro ser gly trp ser 
243/81 SEQ ID NO: 198 ^273/91 

CCA CCG CCC GCA CCC GAC CGG TAC GGA TCG CGC CTC GGG TTA CCG TCG CCG TCA ACG CGC 
pro pro pro ala pro asp arg tyr gly ser arg leu gly leu pro ser pro ser thr arg 
303/101 333/111 

TGG ACA GCA TCG GTC CCC GCT GGG TCA ATG CAC TCA TGC AGC GCC GCA ACG AAC AGC TCA 
trp thr ala ser val pro ala gly ser met his ser cys ser ala ala thr asn ser ser 
363/121 393/131 

ACC CTT GAA CCG GGT CCC GGC CTG CCG ACC CTC GGC CGC CGG CGT GCC GCT ACG TGA TAG 
thr leu glu pro gly pro gly leu pro thr leu gly arg arg arg ala ala thr)OPA AMB 
423/141 453/151 

ACA CAG GGC CAT GGA AAT CCT GGC CAG CCG GAT GCT ACT TCG GCC GGC GGA CTA TCA GCG 
( thr gin gly his gly asn pro gly gin pro asp ala thr ser ala gly gly leu ser ala 
4*83/ 161"-""- — - SEQ ID NO: 199 
GTC GCT GAG CTT CTA CCG TGA CCA GAT C 
val ala glu leu leu pro)OPA pro asp 



SEQ ID NOS: 195-199 

FIG. 11C 



part of the nucleotide sequence of Seqll 



1/1 ^ SEQ ID NO: 


200 










31/11 


















CGT CGC 


CGT 


CAA 


CGC 


GCT 


GGA 


CAG 


CAT 


CGG 


TCC CCG 


CTG 


GGT 


CAA 


TGC 


ACT 


CAT 


GCA 


GCG 


(arg arg 


arg 


gin 


arg 


ala 


gly 


gin 


his 


arg 


ser pro 


leu 


gly 


gin 


cys 


thr 


his 


ala 


ala 


61/21 - 


SEQ 


ID NO: 


201 










91/31 


















CCG CAA 


CGA 


ACA 


GCT 


CAA 


CCC 


TTG 


AAC 


CGG 


GTC CCG 


GCC 


TGC 


CGA 


CCC 


TCG 


GCC 


GCC 


GGC 


pro gin 


arg 


thr 


ala 


gin 


pro 


leu 


asn 


arg 


val pro 


ala 


cys 


arg 


pro 


ser 


ala 


ala 


gly 


121/41 


















151/51 


















GTG CCG 


CTA 


CGT 


GAT 


AGA 


CAC 


AGG 


GCC 


ATG 


GAA ATC 


CTG 


GCC 


AGC 


CGG 


ATG 


CTA 


CTT 


CGG 


val pro 


leu 


arg 


asp 


arg 


his 


arg 


ala 


met 


glu ile 


leu 


ala 


ser 


arg 


met 


leu 


leu 


arg 


181/61 


















211/71 


















CCG GCG 


GAC 


TAT 


CAG 


CGG 


TCG 


CTG 


AGC 


TTC 


TAC CGT 


GAC 


CAG 


ATC 












pro .ala 


asp, 


tyr 


gin 


arg 


ser 


leu 


ser 


phe 


tyr arg 


asp 


gin 


ile) 













SEQ ID NOS:200-201 



FIG. 11A B 



1/1 ^-SEQ ID NO: 202 31/11 



GTC GCC 


GTC 


AAC 


GCG 


CTG 


flap XlCZC ATP 
uliv five r\ J. e 


GGT 


ccc cac 






r\*\ J. 


nc a 


^. 1 L 


ATT* 


par* 




(val ala 


val 


asn 


ala 


leu 


asp ser ile 


gly 


pro arg 


trp 


val 


asn 


ala 


leu 


met 


gin 


arg 


61/21^- 


SEQ 


ID NO: 


203 






91/31 ; 


















CGC AAC 


GAA 


CAG 


CTC 


AAC 


CCT TGA ACC 


GGG 


TCC CGG 


CCT 


GCC 


GAC 


CCT 


CGG 


CCG 


CCG 


GCG 


arg asn 


glu 


gin 


leu 


asn 


pro)OPA(thr 


gly 


ser arg 


pro 


ala 


asp 


pro 


arg 


pro 


pro 


ala 


121/41 




SEQ 


ID NO: 


204 




151/51 


















TGC CGC 


TAC 


GTG 


ATA 


GAC 


ACA GGG CCA 


TGG 


AAA TCC 


TGG 


CCA 


GCC 


GGA 


TGC 


TAC 


TTC 


GGC 


cys arg 


tyr 


val 


ile 


asp 


thr gly pro 


trp 


lys ser 


trp 


pro 


ala 


gly 


cys 


tyr 


phe 


gly 


181/61 














211/71 


















CGG CGG 


ACT 


ATC 


AGC 


GGT 


CGC TGA GCT 


TCT 


ACC GTG 


ACC 


AGA 


TC 












arg arg 


thr 


ile 


ser 


gly 


arg)OPA(ala 


ser 


thr val 


thr 


arg) 















SEQ ID NO: 205 — 

SEQ ID NOS:202-205 



FIG. 11B' 



1/1 SEQ ID NO: 206 










31/11 


















TCG CCG 


TCA ACG 


CGC 


TGG 


ACA 


GCA 


TCG 


GTC 


CCC GCT 


GGG 


TCA 


ATG 


CAC 


TCA 


TGC 


AGC 


GCC 


(ser pro 


ser thr 


arg 


trp 


thr 


ala 


ser 


val 


pro ala 


gly 


ser 


met 


his 


ser 


cys 


ser 


ala 


61/21^- 


— SEQ ID 


NO: 


207 












91/31 










GCA ACG 


AAC AGC 


TCA 


ACC 


CTT 


GAA 


CCG 


GGT 


CCC GGC 


CTG 


CCG 


ACC 


CTC 


GGC 


CGC 


CGG 


CGT 


ala thr 


asn ser 


ser 


thr 


leu 


glu 


pro 


gly 


pro gly 


leu 


pro 


thr 


leu 


gly 


arg 


arg 


arg 


121/41 
















151/51 


















GCC GCT 


ACG TGA 


TAG 


ACA 


CAG 


GGC 


CAT 


GGA 


AAT CCT 


GGC 


CAG 


CCG 


GAT 


GCT 


ACT 


TCG 


GCC 


ala ala 


thr)OPA 


AMB(thr 


gin 


gly 


his 


gly 


asn pro 


gly 


gin 


pro 


asp 


ala 


thr 


ser 


ala 


181/61 






* SEQ ID 


NO: 


208 


211/71 


















GGC GGA 


CTA TCA 


GCG 


GTC 


GCT 


GAG 


CTT 


CTA 


CCG TGA 


CCA 


GAT 


C 












gly gly 


leu ser 


ala 


val 


ala 


glu 


leu 


leu 


pro)OPA 


pro 


asp 















SEQ XD NOS:206-208 

FIG. 11C" 



sequence Rv0546c predicted by Cole et al . (Nature 393:537-544) and containing 
SeqllA* 



1/1^- SEQ ID NO: 


209 










31/11 


















a*tg gaa 


ate 


ctg 


gec 


age 


egg 


atg 


eta 


ctt 


egg ccg 


gcg 


gac 


tat 


cag 


egg 


teg 


ctg 


age 


(Met glu 


ile 


leu 


ala 


ser 


arg 


met 


leu 


leu 


arg pro 


ala 


asp 


tyr 


gin 


arg 


ser 


leu 


ser 


61/21^-SEQ ID NO: 210 










91/31 


















ttc tac 


cgt 


gac 


cag 


ate 


ggg 


ctg 


gcg 


att 


gee cgt 


gaa 


tac 


ggg 


gee 


ggc 


aca 


gtg 


ttt 


phe tyr 


arg 


asp 


gin 


ile 


gly 


leu 


ala 


ile 


ala arg 


glu 


tyr 


gly 


ala 


gly 


thr 


val 


phe 


121/41 


















151/51 


















ttc gec 


ggt 


cag 


tea 


ctg 


etc 


gaa 


ctg 


gee 


ggt tac 


ggc 


gag 


ccg 


gac 


cat 


teg 


egg 


gga 


phe ala 


gly 


gin 


ser 


leu 


leu 


glu 


leu 


ala 


gly tyr 


gly 


glu 


pro 


asp 


his 


ser 


arg 


gly 


181/61 


















211/71 


















cct ttt 


CCC 


ggc 


gcg 


ctg 


tgg 


ctg 


cag 


gtg 


cgc gac 


etc 


gag 


get 


acc 


cag 


acc 


gag 


ctg 


pro phe 


pro 


gly 


ala 


leu 


trp 


leu 


gin 


val 


arg asp 


leu 


glu 


ala 


thr 


gin 


thr 


glu 


leu 


241/81 


















271/91 


















gtc age 


cga 


ggc 


gtg 


teg 


ate 


get 


cgc 


gag 


ccc cgc 


cgc 


gaa 


ccg 


tgg 


ggc 


ctg 


cac 


gag 


val ser 


arg 


gly 


val 


ser 


ile 


ala 


arg 


glu 


pro arg 


arg 


glu 


pro 


trp 


gly 


leu 


his 


glu 


301/101 


















331/111 


















atg cat 


gtg 


acc 


gac 


cca 


gac 


ggg 


ate 


aca 


ctg ata 


ttc 


gtc 


gag 


gtt 


ccc 


gag 


ggt 


cac 


met his 


val 


thr 


asp 


pro 


asp 


gly 


ile 


thr 


leu ile 


phe 


val 


glu 


val 


pro 


glu 


gly 


his 


361/121 




































ccg ctg 


cgt 


aca 


gac 


acc 


egg 


gcg 


tga 






















pro leu 


arg 


thr 


asp 


thr 


arg 


ala 


)OPA 























SEQ ZD NOS: 209-210 

FIG. 11D 



ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv0546c 
1/1__^-SEQ ID NO: 211 31/11 

tag tea ggg cgt gca ttc gac gac get gta eta ccc get ggt ggc aac tec gat gat tgc 

AMB ( ser gly arg ala phe asp asp ala val leu pro ala gly gly asn ser asp asp cys 
61/21^SEQ ID NO: 212 91/31 

gec gac gaa ggc eta cga egg get gee ggc get gac cgc cgc gga age cgc cga gtg gat 

ala asp glu gly leu arg arg ala ala gly ala asp arg arg gly ser arg arg val asp 
121/41 151/51 

ggt cac cgc cgc ccg cac ccg acc ggt gcg gat cgc gee teg ggt tgc cgt cgc cgt caa 

gly his arg arg pro his pro thr gly ala asp arg ala ser gly cys arg arg arg gin 
181/61 211/71 

cgc get gga cag cat egg tec ccg ctg ggt caa tgc act cat gca gcg ccg caa cga aca 

arg ala gly gin his arg ser pro leu gly gin cys thr his ala ala pro gin arg thr 
241/81 271/91 

get caa ccc ttg aac egg gtc ccg gee tgc cga ccc teg gee gee ggc gtg ccg eta cgt 

ala gin pro leu asn arg val pro ala cys arg pro ser ala ala gly val pro leu arg 
301/101 331/111 

gat aga cac agg gee atg gaa ate ctg gee age egg atg eta ctt egg ccg gcg gac tat 

asp arg his arg ala met glu ile leu ala ser arg met leu leu arg pro ala asp tyr 
361/121 391/131 

cag egg teg ctg age ttc tac cgt gac cag ate ggg ctg gcg att gee cgt gaa tac ggg 

gin arg ser leu ser phe tyr arg asp gin ile gly leu ala ile ala arg glu tyr gly 
421/141 451/151 

gee ggc aca gtg ttt ttc gee ggt cag tea ctg etc gaa ctg gee ggt tac ggc gag ccg 

ala gly thr val phe phe ala gly gin ser leu leu glu leu ala gly tyr gly glu pro 
481/161 511/171 

gac cat teg egg gga cct ttt ccc ggc gcg ctg tgg ctg cag gtg cgc gac etc gag get 

asp his ser arg gly pro phe pro gly ala leu trp leu gin val arg asp leu glu ala 
541/181 571/191 

acc cag acc gag ctg gtc age cga ggc gtg teg ate get cgc gag ccc cgc cgc gaa ccg 

thr gin thr glu leu val ser arg gly val ser ile ala arg glu pro arg arg glu pro 
601/201 631/211 

tgg ggc ctg cac gag atg cat gtg acc gac cca gac ggg ate aca ctg ata ttc gtc gag 

trp gly leu his glu met his val thr asp pro asp gly ile thr leu ile phe val glu 

661/221 691/231 

gtt ccc gag ggt cac ccg ctg cgt aca gac acc egg gcg tga 

val pro glu gly his pro leu arg thr asp thr arg ala)OPA 



SEQ ID NOS: 211-212 



FIG. 11E 



CP 1 



1/1^—SEQ ID NO: 213 
gac cga agg gat ttc gcg act 
( asp ar g arg asp phe ala thr 
61/21 SEQ ID NO: 214 

gac gta gac agg aag aga cag 
asp val asp arg lys arg gin 
121/41 

gag tct ttc cga gtt cag caa 
glu ser phe arg val gin gin 
181/61 

gcg gcc egg gec get teg ggc 
ala ala arg ala ala ser gly 
241/81 

gac aac cgc gcg gaa ttc aat 
asp asn arg ala glu phe asn 
301/101 

aac etc egg agt gat c 
asn leu arg ser asp) 



aac teg gcc 
asn ser ala 

gga age tga 
gly ser)OPA 

caa teg aca 
gin ser thr 

cga gtg tct 
arg val ser 

egg atg gcg 
arg met ala 



31/11 
tgt aag 
cys lys 
91/31 
tga cgt 
OPA(arg 
151/51^ 
cag aag 
gin lys 
211/71 - 
gag taa 
glu)OCH< 
271/91 
ggc ggg 
gly gly 



gca acg cga 
ala thr arg 

cgc gta ccg 
arg val pro 
—SEQ ID NO: 

egg gga cca 
arg gly pro 

gac cag agt 
asp gin ser 
^SEQ ID NO J 
ace gga ttg 
thr gly leu 



ggt ctt cat 
gly leu his 

gac cgc cat 
asp arg his 

215 
gac egg gag 
asp arg glu 

cac ggg tec 
his gly ser 

216 
cgc egg tea 
arg arg ser 



gcc gag 
ala glu 

tct gtc 
ser val 

gac gac 
asp asp 

gtg tgt 
val cys 

ccg agg 
pro arg 



SEQ ID NOS: 213-216 



FIG. 12A 



1/_1^--SEQ ID NO: 217 




ace gaa 


ggg 


att 


teg 


cga 


eta 


(thr glu 


gly 


ile 


ser 


arg 


leu 


61/21^- 


SEQ 


ID NO: 


218 




acg tag 


aca 


gga 


aga 


gac 


agg 


thr)AMB(thr 


gly 


arg 


asp 


arg 


121/41 




SEQ 


ID NO: 


219 


agt ctt 


tec 


gag 


ttc 


age 


aac 


ser leu 


ser 


glu 


phe 


ser 


asn 


181/61 












egg ccc 


ggg 


ccg 


ctt 


egg 


gcc 


arg pro 


gly 


pro 


leu 


arg 


ala 


241/81 












aca acc 


gcg 


egg 


aat 


tea 


ate 


thr thr 


ala 


arg 


asn 


ser 


ile 


301/101 












acc tec 


gga 


gtg 


ate 






thr ser 


gly 


val 


ile) 







31/11 

act egg cct gta agg caa cgc 
thr arg pro val arg gin arg 
- 91/31 

gaa get gat gac gtc gcg tac 
glu ala asp asp val ala tyr 
151/51 

aat cga cac aga age ggg gac 
asn arg his arg ser gly asp 
211/71 

gag tgt ctg agt aag acc aga 
glu cys leu ser lys thr arg 
271/91 

gga tgg egg gcg gga ccg gat 
gly trp arg ala gly pro asp 



gag gtc ttc atg ccg agg 
glu val phe met pro arg 

egg acc gcc att ctg teg 
arg thr ala ile leu ser 

cag acc ggg agg acg acg 
gin thr gly arg thr thr 

gtc acg ggt ccg tgt gtg 
val thr gly pro cys val 

tgc gcc ggt cac cga gga 
cys ala gly his arg gly 



SEQ XD NOS: 217-219 



FIG. 12B 



MAR 2 6 2003 S] 



1/1 SEQ ID NO: 220 31/11 

ccg aag gga ttt cgc gac taa etc ggc ctg taa ggc aac gcg agg tct tea tgc cga gga 
(p ro ly s gly phe arg asp)OCH leu gly leu OCH(gly asn ala arg ser ser cys arg gly 

6 1/ 2 1"*- — SEQ ID MO: 221 91/3l"* SEQ ID NO: 222 

cgt aga cag gaa gag aca ggg aag ctg atg acg teg cgt ace gga ccg cca ttc tgt cga 
arg arg gin glu glu thr gly lys leu met thr ser arg thr gly pro pro phe cys arg 
121/41 151/51 

gtc ttt ccg agt tea gca aca ate gac aca gaa gcg ggg acc aga ccg gga gga cga cgc 
val phe pro ser ser ala thr ile asp thr glu ala gly thr arg pro gly gly arg arg 
181/61 211/71 

ggc ccg ggc cgc ttc ggg ccg agt gtc tga gta aga cca gag tea egg gtc cgt gtg tga 
gly pro gly arg phe gly pro ser val)OPA(val arg pro glu ser arg val arg val)OPA 
241/81 SEQ ID NO: 223 ^271/91 

caa ccg cgc gga att caa teg gat ggc ggg egg gac egg att gcg ccg gtc acc gag gaa 
(gJLn pro arg gly ile gin ser asp gly gly arg asp arg ile ala pro val thr glu glu 
301/lOT — -SEQ ID NO: 224 
cct ccg gag tga tc 
pro pro glu)OPA 



SEQ ZD NOS: 220-224 

FIG. 12C 



1/1^ SEQ ID NO: 225 31/11 



GGG ATT 


TCG 


TTG 


CCC 


GAT 


GGA 


TTG 


TTT 


GTA 


CGG TTT 


GGG 


AAA 


AAC 


ACT 


TGA 


AGT 


CCT 


TTT 


(gly ile 


ser 


leu 


pro 


asp 


gly 


leu 


phe 


val 


arg phe 


gly 


lys 


asn 


thr 


) OPA( ser 


pro 


phe 


6T73T "—SEQ ID 


NO: 


226 










91/31 


SEQ 


ID NO: 


227- 










TAT TGG 


CAA 


TGC 


TGG 


AAA 


TGG 


ACA 


TTC 


CAA 


TAT TGC 


GCG 


AAT 


TAA 


CCG 


AAC 


ACG 


GTG 


AGG 


tyr trp 


gin 


cys 


trp 


lys 


trp 


thr 


phe 


gin 


tyr cys 


ala 


asn)OCH(pro 


asn 


thr 


val 


arg 


121/41 


















151/51 








V SEQ ID 


NO: 


923 


GGG GGG 


CAA 


GCG 


TTT 


GTA 


CCG 


GGG 


CCA 


GCA 


AGC GCC 


GCC 


GAC 


CGG 


TTG 


ACC 


GAA 


GCC 


AGC 


gly gly 


gin 


ala 


phe 


val 


pro 


gly 


pro 


ala 


ser ala 


ala 


asp 


arg 


leu 


thr 


glu 


ala 


ser 


181/61 


















211/71 


















ATG TTG 


TTG 


TGT 


CAG 


CGC 


GGG 


CTT 


GGT 


CTC 


GAT GTC 


CCG 


GCC 


TTG 


GCT 


GGA 


CCC 


GCT 


TCT 


met leu 


leu 


cys 


gin 


arg 


gly 


leu 


gly 


leu 


asp val 


pro 


ala 


leu 


ala 


gly 


pro 


ala 


ser 


241/81 


















271/91 


















TCA AAA 


CAG 


GTT 


GAA 


CTT 


AAC 


GAC 


TCA 


AGA 


ACG GAA 


ACG 


CTT 


GAA 


CCG 


CGA 


CGT 


CGC 


TCC 


ser lys 


gin 


val 


glu 


leu 


asn 


asp 


ser 


arg 


thr glu 


thr 


leu 


glu 


pro 


arg 


arg 


arg 


ser 


301/101 


















331/111 


















GGA CAC 


CAA 


TTT 


GAC 


TCG 


GCT 


CTT 


TGG 


CAA 


TTG AAG 


GTG 


AGC 


TGC 


GAG 


CAG 


CCG 


GGT 


GAC 


gly his 


gin 


phe 


asp 


ser 


ala 


leu 


trp 


gin 


leu lys 


val 


ser 


cys 


glu 


gin 


pro 


gly 


asp 


361/121 


















391/131 


















CGC ATC 


GTT 


GGC 


CTT 


GCC 


ATC 


AAT 


CGC 


CGG 


CTC GCG 


GAC 


GTA 


GAT 


AAT 


CAG 


CTC 


ACC 


GTT 


arg ile 


val 


gly 


leu 


ala 


ile 


asn 


arg 


arg 


leu ala 


asp 


val 


asp 


asn 


gin 


leu 


thr 


val 


421/141 


















451/151 


















GGG ACC 


GAC 


CTC 


GAC 


CAG 


GGG 


TCC 


TTT 


GTG 


ACT GCC 


GGG 


CTT 


GAC 


GCG 


GAC 


GAC 


CAC 


AGA 


gly thr 


asp 


leu 


asp 


gin 


gly 


ser 


phe 


val 


thr ala 


gly 


leu 


asp 


ala 


asp 


asp 


his 


arg 


481/161 


















511/171 


















GTC GGT 


CAT 


CGC 


CTA 


AGG 


CTA 


CCG 


TTC 


TGA 


CCT GGG 


GCT 


GCG 


TGG 


GCG 


CCG 


ACG 


ACG 


TGA 


val gly 


his 


arg 


leu 


arg 


leu 


pro 


phe)OPA(pro gly 


ala 


ala 


trp 


ala 


pro 


thr 


thr)OPA 


541/181 








SEQ ID NOi 


! 924-*^ 


571/191 


















GGC ACG 


TCA 


TGT 


CTC 


AGC 


GGC 


CCA 


CCG 


CCA 


CCT CGG 


TCG 


CCG 


GCA 


GTA 


TGT 


CAG 


CAT 


GTG 


(gly thr 
6U1/201 


ser 


cys 


leu 


ser 


gly 


pro 


pro 


pro 


pro arg 


ser 


pro 


ala 


val 


cys 


gin 


his 


val 


^SEQ ID 


NO: 


925 








631/211 


















CAG ATG 


ACT 


CCA 


CGC 


AGC 


CTT 


GTT 


CGC 


ATC 


GTT GGT 


GTC 


GTG 


GTT 


GCG 


ACG 


ACC 


TTG 


GCG 


gin met 


thr 


pro 


arg 


ser 


leu 


val 


arg 


ile 


val gly 


val 


val 


val 


ala 


thr 


thr 


leu 


ala 


661/221 


















691/231 


















CTG GTG 


AGC 


GCA 


CCC 


GCC 


GGC 


GGT 


CGT 


GCC 


GCG CAT 


GCG 


GAT 


C 












leu val 


ser 


ala 


pro 


ala 


gly 


gly 


arg 


ala 


ala his 


ala 


asp) 













SEQ ID NOS:220-227, 923-925 



FIG. 13A 



SEQ 


ID NO: 


228 












32/11 


















GGA TTT 


CGT 


TGC 


CCG 


ATG 


GAT 


TGT 


TTG 


TAC 


GGT TTG 


GGA 


AAA 


ACA 


CTT 


GAA 


GTC 


CTT 


TTT 




arg 


cys 


pro 


met 


asp 


cys 


leu 


tyr 


gly leu 


gly 


lys 


thr 


leu 


glu 


val 


leu 


phe 


62/21 


-SEQ ID 


NO: 


229 








92/31 


















ATT GGC 


AAT 


GCT 


GGA 


AAT 


GGA 


CAT 


TCC 


AAT 


ATT GCG 


CGA 


ATT 


AAC 


CGA 


ACA 


CGG 


TGA 


GGG 


ile gly 


asn 


ala 


gly 


asn 


gly 


his 


ser 


asn 


ile ala 


arg 


ile 


asn 


arg 


thr 


arg)OPA(gly 


122/41 


















152/51 






SEQ ID NO: 


230—^ 




GGG GGC 


AAG 


CGT 


TTG 


TAC 


CGG 


GGC 


CAG 


CAA 


GCG CCG 


CCG 


ACC 


GGT 


TGA 


CCG 


AAG 


CCA 


GCA 


gly gly 


lys 


arg 


leu 


tyr 


arg 


gly 


gin 


gin 


ala pro 


pro 


thr 


gly) OPA(pro 


lys 


pro 


ala 


182/61 


















212/71 


SEQ ID 


NO: 


231 y 








TGT TGT 


TGT 


GTC 


AGC 


GCG 


GGC 


TTG 


GTC 


TCG 


ATG TCC 


CGG 


CCT 


TGG 


CTG 


GAC 


CCG 


CTT 


CTT 


cys cys 


cys 


val 


ser 


ala 


gly 


leu 


val 


ser 


met ser 


arg 


pro 


trp 


leu 


asp 


pro 


leu 


leu 


242/81 


















272/91 


















CAA AAC 


AGG 


TTG 


AAC 


TTA 


ACG 


ACT 


CAA 


GAA 


CGG AAA 


CGC 


TTG 


AAC 


CGC 


GAC 


GTC 


GCT 


CCG 


gin asn 


arg 


leu 


asn 


leu 


thr 


thr 


gin 


glu 


arg lys 


arg 


leu 


asn 


arg 


asp 


val 


ala 


pro 


302/101 


















332/111 


















GAC ACC 


AAT 


TTG 


ACT 


CGG 


CTC 


TTT 


GGC 


AAT 


TGA AGG 


TGA 


GCT 


GCG 


AGC 


AGC 


CGG 


GTG 


ACC 


asp thr 


asn 


leu 


thr 


arg 


leu 


phe 


gly 


asn)OPA arg 


OPA(ala 


ala 


ser 


ser 


arg 


val 


thr 


362/121 


















392/131 






SEQ ID NO : 


232 




GCA TCG 


TTG 


GCC 


TTG 


CCA 


TCA 


ATC 


GCC 


GGC 


TCG CGG 


ACG 


TAG 


ATA 


ATC 


AGC 


TCA 


CCG 


TTG 


ala ser 


leu 


ala 


leu 


pro 


ser 


ile 


ala 


gly 


ser arg 


thr)AMB(ile 


ile 


ser 


ser 


pro 


leu 


422/141 


















452/151 






SEQ ID NO: 233 




GGA CCG 


ACC 


TCG 


ACC 


AGG 


GGT 


CCT 


TTG 


TGA 


CTG CCG 


GGC 


TTG 


ACG 


CGG 


ACG 


ACC 


ACA 


GAG 


gly pro 


thr 


ser 


thr 


arg 


gly 


pro 


leu)OPA(leu pro 


gly 


leu 


thr 


arg 


thr 


thr 


thr 


glu 


482/161 








SEQ ID NO 


: 234—*" 


512/171 


















TCG GTC 


ATC 


GCC 


TAA 


GGC 


TAC 


CGT 


TCT 


GAC 


CTG GGG 


CTG 


CGT 


GGG 


CGC 


CGA 


CGA 


CGT 


GAG 


ser val 


ile 


ala)OCH(gly 


tyr 


arg 


ser 


asp 


leu gly 


leu 


arg 


gly 


arg 


arg 


arg 


arg 


glu 


542/181 








^SEQ ID NO 


: 235 


572/191 


















GCA CGT 


CAT 


GTC 


TCA 


GCG 


GCC 


CAC 


CGC 


CAC 


CTC GGT 


CGC 


CGG 


CAG 


TAT 


GTC 


AGC 


ATG 


TGC 


ala arg 


his 


val 


ser 


ala 


ala 


his 


arg 


his 


leu gly 


arg 


arg 


gin 


tyr 


val 


ser 


met 


cys 


602/201 


















632/211 


















AGA TGA 


CTC 


CAC 


GCA 


GCC 


TTG 


TTC 


GCA 


TCG 


TTG GTG 


TCG 


TGG 


TTG 


CGA 


CGA 


CCT 


TGG 


CGC 


arg) OPA( leu 


his 


ala 


ala 


leu 


phe 


ala 


ser 


leu val 


ser 


trp 


leu 


arg 


arg 


pro 


trp 


arg 


662/221 


^SEQ ID NO: 


236 








692/231 


















TGG TGA 


GCG 


CAC 


CCG 


CCG 


GCG 


GTC 


GTG 


CCG 


CGC ATG 


CGG 


ATC 















trp)OPA(ala his pro pro ala val val pro arg met arg ile) 
^SEQ ID NO: 237 



SEQ ID NOS: 228-237 



FIG. 13B 



SEQ ID NO: 238 33/11 



GAT TTC 


GTT 


GCC 


CGA 


TGG 


ATT 


GTT 


TGT 


ACG 


GTT TGG 


GAA 


AAA 


CAC 


TTG 


AAG 


TCC 


TTT 


TTA 


(aspphe 


val 


ala 


arg 


trp 


ile 


val 


cys 


thr 


val trp 


glu 


lvs 

Jr 


his 


leu 


lvs 


ser 


phe 


leu 


63/21 — - 


SEQ ID NO: 


239 








93/31 


















TTG GCA 


ATG 


CTG 


GAA 


ATG 


GAC 


ATT 


CCA 


ATA 


TTG CGC 


GAA 


TTA 


ACC 


GAA 




VjVJ J. 


GAG 


GGG 


leu ala 


met 


leu 


glu 


met 


asp 


ile 


pro 


ile 


leu arg 


glu 


leu 


thr 


glu 


his 




glu 




123/41 


















153 /51 


















GGG GCA 


AGC 


GTT 


TGT 


ACC 


GGG 


GCC 


AGC 


AAG 


CGC CGC 


CGA 


CCG 


GTT 


GAC 


CGA 


AGC 


CAG 


CAT 


crlv ala 


ser 


val 


cys 


thr 


gly 


ala 


ser 


lvs 


arg arg 


arg 


pro 


val 


asp 


arg 


ser 


gin 


his 


183/61 


















213 /71 


















GTT GTT 


GTG 


TCA 


GCG 


CGG 


GCT 


TGG 


TCT 


CGA 


TGT CCC 


GGC 


CTT 


GGC 


TGG 


ACC 


CGC 


TTC 


TTC 


va 1 va 1 


val 


ser 


ala 


arg 


ala 


trp 


ser 


arg 


cvs nro 


gly 


leu 


gly 


trp 


thr 


arg 


phe 


phe 


243/81 


















273/91 


















AAA ACA 


GGT 


TGA 


ACT 


TAA 


CGA 


CTC 


AAG 


AAC 


GGA AAC 


GCT 


TGA 


ACC 


GCG 


ACG 


TCG 


CTC 


CGG 


lys thr 


gly)OPA 


thr 


OCH(arg . 


leu 


lys 


asn 


gly asn 


ala)OPA(thr 


ala 


thr 


ser 


leu 


arg 


303/101 


SEQ 


ID NO: 


240^ 








333/111 






^-SEQ ID NO: 241 




ACA CCA 


ATT 


TGA 


CTC 


GGC 


TCT 


TTG 


GCA 


ATT 


GAA GGT 


GAG 


CTG 


CGA 


GCA 


GCC 


GGG 


TGA 


CCG 


thr pro 


ile)OPA(leu 


gly 


ser 


leu 


ala 


ile 


glu gly 


glu 


leu 


arg 


ala 


ala 


gly)OPA(pro 


363/121 






^-SEQ ID NO: 


242 




393/131 






SEQ ID 


NO: 


243 ^ 




CAT CGT 


TGG 


CCT 


TGC 


CAT 


CAA 


TCG 


CCG 


GCT 


CGC GGA 


CGT 


AGA 


TAA 


TCA 


GCT 


CAC 


CGT 


TGG 


his arg 


trp 


pro 


cys 


his 


gin 


ser 


pro 


ala 


arg gly 


arg 


arg)OCH(ser 


ala 


his 


arg 


trp 


423/141 


















453/151 








^-SEQ ID NO: 244 


GAC CGA 


CCT 


CGA 


CCA 


GGG 


GTC 


CTT 


TGT 


GAC 


TGC CGG 


GCT 


TGA 


CGC 


GGA 


CGA 


CCA 


CAG 


AGT 


asp arg 


pro 


arg 


pro 


gly 


val 


leu 


cys 


asp 


cys arg 


ala)OPA(arg 


gly 


arg 


pro 


gin 


ser 


483/161 


















513/171 






SEQ ID NO: 245 




CGG TCA 


TCG 


CCT 


AAG 


GCT 


ACC 


GTT 


CTG 


ACC 


TGG GGC 


TGC 


GTG 


GGC 


GCC 


GAC 


GAC 


GTG 


AGG 


arg ser 


ser 


pro 


lys 


ala 


thr 


val 


leu 


thr 


trp gly 


cys 


val 


gly 


ala 


asp 


asp 


val 


arg 


543/181 


















573/191 


















CAC GTC 


ATG 


TCT 


CAG 


CGG 


CCC 


ACC 


GCC 


ACC 


TCG GTC 


GCC 


GGC 


AGT 


ATG 


TCA 


GCA 


TGT 


GCA 


his val 


met 


ser 


gin 


arg 


pro 


thr 


ala 


thr 


ser val 


ala 


gly 


ser 


met 


ser 


ala 


cys 


ala 


603/201 


















633/211 


















GAT GAC 


TCC 


ACG 


CAG 


CCT 


TGT 


TCG 


CAT 


CGT 


TGG TGT 


CGT 


GGT 


TGC 


GAC 


GAC 


CTT 


GGC 


GCT 


asp asp 


ser 


thr 


gin 


pro 


cys 


ser 


his 


arg 


trp cys 


arg 


gly 


cys 


asp 


asp 


leu 


gly 


ala 


663/221 


















693/231 


















GGT GAG 


CGC 


ACC 


CGC 


CGG 


CGG 


TCG 


TGC 


CGC 


GCA TGC 


GGA 


TC 














gly glu 


arg 


thr 


arg 


arg 


arg 


ser 


cys 


arg 


ala cys 


gly) 















SEQ ID NOS: 238-245 



FIG. 13C 



of the nucleotide sequence of seql3A 
1^/1_^-SEQ ID NO: 246 
GGG TCC TTT GTG ACT GCC GGG CTT 
( gly se r phe val thr ala gly leu 
61/21 ""SEQ ID NOt 247 
CTA CCG TTC TGA CCT GGG GCT GCG 
leu pro phe)OPA(pro gly ala ala 
121/41 ^SEQ ID NO: 24 

GGC CCA CCG CCA CCT CGG TCG CCG 
gly pro pro pro pro arg ser pro 
181/61 

CTT GTT CGC ATC GTT GGT GTC GTG 
leu val arg ile val gly val val 
241/81 

GGC GGT CGT GCC GCG CAT GCG GAT 
gly gly arg ala ala his ala asp) 



asp 



trp ala 
18 



val ala 



31/11 


















GAC GAC 


CAC 


AGA 


GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


asp asp 


his 


arg 


val 


gly 


his 


arg 


leu 


arg 


91/31 


















CCG ACG 


ACG 


TGA 


GGC 


ACG 


TCA 


TGT 


CTC 


AGC 


pro thr 


thr)OPA(gly 


thr 


ser 


cys 


leu 


ser 


151/51 






V SEQ ID 


NO: 


249 




TGT CAG 


CAT 


GTG 


CAG 


ATG 


ACT 


CCA 


CGC 


AGC 


cys gin 


his 


val 


gin 


met 


thr 


pro 


arg 


ser 


211/71 


















ACG ACC 


TTG 


GCG 


CTG 


GTG 


AGC 


GCA 


CCC 


GCC 


thr thr 


leu 


ala 


leu 


val 


ser 


ala 


pro 


ala 



SEQ XD NOS: 246-249 



FIG. 13A' 



l/l/SEQ ID 


NO: 


250 










31/11 










GGT CCT 


TTG 


TGA 


CTG 


CCG 


GGC 


TTG 


ACG 


CGG 


ACG ACC 


ACA 


GAG 


TCG GTC ATC 


GCC TAA GGC 


gly pro (leu 


OPA 


leu 


pro 


gly 


leu 


thr 


arg 


thr thr 


thr 


glu 


ser val ile 


ala)OCH(gly 


61/21 


<*SEQ 


ID NO: 


251 










91/31 






SEQ ID NO: 252"^ 


TAC CGT 


TCT 


GAC 


CTG 


GGG 


CTG 


CGT 


GGG 


CGC 


CGA CGA 


CGT 


GAG 


GCA CGT CAT 


GTC TCA GCG 


tyr arg 


ser 


asp 


leu 


gly 


leu 


arg 


gly 


arg 


arg arg 


arg 


glu 


ala arg his 


val ser ala 


121/41 


















151/51 










GCC CAC 


CGC 


CAC 


CTC 


GGT 


CGC 


CGG 


CAG 


TAT 


GTC AGC 


ATG 


TGC 


AGA TGA CTC 


CAC GCA GCC 


ala his 


arg 


his 


leu 


gly 


arg 


arg 


gin 


tyr 


val ser 


met 


cys 


arg) OPA (leu 


his ala ala 


181/61 


















211/71 






*SEQ 


ID NO: 253 


TTG TTC 


GCA 


TCG 


TTG 


GTG 


TCG 


TGG 


TTG 


CGA 


CGA CCT 


TGG 


CGC 


TGG TGA GCG 


CAC CCG CCG 


leu phe 


ala 


ser 


leu 


val 


ser 


trp 


leu 


arg 


arg pro 


trp 


arg 


trp)OPA(ala 


his pro pro 


241/81 


















SEQ ID 


NO: 


254-— ^ 




GCG GTC 


GTG 


CCG 


CGC 


ATG 


CGG 


ATC 
















ala val 


val 


pro 


arg 


met 


arg 


ile) 

















SEQ JD NOS: 250-254 



FIG. 13B' 

1/1 ^SEQ ID NO: 255 31/11 



GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT 


TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


(val leu 


cys 


asp 


cys 


arg 


ala)OPA(arg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


61/21^ 


— SEQ ID 


NO: 


256 








91/31 ^ 


SEQ 


ID NO: 


257 










ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC 


GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


CGG 


thr val 


leu 


thr 


trp. 


gly 


cys 


val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


121/41 


















151/51 


















CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC 


GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


CAG 


CCT 


pro thr 


ala 


thr 


ser 


val 


ala 


gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin 


pro 


181/61 


















211/71 


















TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT 


GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


CGG 


cys ser 


his 


arg 


trp 


cys 


arg 


gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu 


arg 


thr 


arg 


arg 



241/81 

CGG TCG TGC CGC GCA TGC GGA TC 
arg ser cys arg ala cys gly) 



SEQ XD NOS: 255-257 

FIG. 13C" 



MAR 2 6 2MB cn 



ence Rvl984c predicted by Cole et al . (Nature 393:537-544) and 
containing seql3A' 



l/l^^-SEQ ID NO: 258 










31/11 


















atg act 


cca 


cgc 


age 


ctt 


gtt 


cgc 


ate 


gtt 


ggt gtc 


gtg 


gtt 


qca 

23 w 23 


acg 


acc 


tta 


aca 


ctg 


(Met thr 


pro 


arg 


ser 


leu 


val 


arg 


ile 


val 


gly val 


val 


val 


ala 


thr 


thr 


leu 


ala 


leu 


61/21^SEQ ID NO: 259 










91/31 


















gtg age 


gca 


cec 


gee 


y y v- 


exert 

23 23 w 


cat 


QCC 

j3 w w 


crccr 

33 ^23 


cat gcg 


gat 


ccg 


tgt 


teg 


gac 


ate 


dCZd 

y *--y 


gtc 


val ser 


ala 


oro 


ala 


cr lv 


crl v 


arcr 


ala 


ala 


his ala 


asp 


pro 


cys 


ser 


asp 


ile 


ala 


va 1 


121/41 


















151/51 


















afct ttc 


gc t 


cgc 


y y*- 


acg 


cat 


cag 


get 


tct 


ggt ctt 


crcrc 

23 23 w 


gac 


ate 


aat 


aaa 

y cay 


aca 
y *-y 


ttc 


gtc 


val phe 


ala 


arg 


a lv 


thr 


his 


gin 


ala 


ser 


gly leu 


crlv 
y J 


asp 


val 


y J-y 


glu 


ala 


phe 


va 1 


181/61 


















211/71 


















yciv* y 


ct t 


acc 




caa 


gtt 




crcrcr 
yyy 


CCfCf 


tea att 


add 
yyy 


gtc 


tac * 


y *-y 


y *-y 


aac 


tac 


cca 


asp ser 


leu 


thr 


s er 


gin 


val 


crlv 


CT lv 


arg 


ser ile 


crlv 

y -J: 


val 




ala 


val 


asn 


t~ \/'T~ 


pro 


241/81 


















271/91 


















aca acre 


gac 


gac 


tac 


cgc 


crccr 


age 


aca 


tea 


aac ggt 


tec 


gat 


gat 


y y 


aac 
Q-y v_. 


gec 


cac 


ate 


ala ser 


asp 


asp 




arg 


ala 


ser 


ala 


ser 


asn alv 


ser 


asp 


asp 


ala 


ser 


ala 


his 


ile 


301/101 


















331/111 


















cag cgc 


acc 


gtc 


gec 


age 


tgc 


ccg 


aac 


acc 


acrcr att 


atcr 
y *-y 


ctt 


aat - 


yy^- 


tat 


ut-y 




yy *- 


gin arg 


thr 


val 


ala 


ser 


cys 


pro 


asn 


thr 


arg ile 


val 


leu 


alv 

y Jr 


al v 




ser 


gin 


y - 1 - y 


361/121 


















391/131 


















gcg acg 


gtc 


ate 


gat 


ttg 


tec 


acc 


tCQ 


crccr 

23 w 23 


atcr cccr 


cec 


oca 

23 > — 23 


ata 

23 *-23 


gca 


ga t 


cat 


ate 


gee 


ala thr 


val 


ile 


asp 


leu 


ser 


thr 


ser 


ala 


met pro 


pro 


ala 


val 


ala 


asp 


his 


val 


ala 


421/141 


















451/151 


















get gtc 


gec 


ctt 


ttc 


ggc 


gag 


cca 


tec 


agt 


ggt ttc 


tec 


age 


atg 


ttg 


tgg 


ggc 


ggc 


ggg 


ala val 


ala 


leu 


phe 


gly 


glu 


pro 


ser 


ser 


gly phe 


ser 


ser 


met 


leu 


trp 


gly 


gly 


gly 


481/161 


















511/171 


















teg ttg 


ccg 


aca 


ate 


ggt 


ccg 


ctg 


tat 


age 


tct aag 


acc 


ata 


aac 


ttg 


tgt 


get 


cec 


gac 


ser leu 


pro 


thr 


ile 


gly 


pro 


leu 


tyr 


ser 


ser lys 


thr 


ile 


asn 


leu 


cys 


ala 


pro 


asp 


541/181 


















571/191 


















gat cca 


ata 


tgc 


acc 


gga 


ggc 


ggc 


aat 


att 


atg gcg 


cat 


gtt 


teg 


tat 


gtt 


cag 


teg 


ggg 


asp pro 


ile 


cys 


thr 


gly 


gly 


gly 


asn 


ile 


met ala 


his 


val 


ser 


tyr 


val 


gin 


ser 


gly 


601/201 


















631/211 


















atg aca 


age 


cag 


gcg 


gcg 


aca 


ttc 


gcg 


gcg 


aac agg 


etc 


gat 


cac 


gee 


gga 


tga 






met thr 


ser 


gin 


ala 


ala 


thr 


phe 


ala 


ala 


asn arg 


leu 


asp 


his 


ala 


gly)OPA 







SEQ ID NOS: 258-259 



FIG. 13D 



Seql3F: ORF predicted by Cole et al . (Nature 393:537-544) and containing 
Rvl984c 



1/1_^-SEQ ID NO: 260 31/11 



tga ggc 
OPA(gly 
61/21* — 


acg 
thr 
SEQ 


tea tgt 
ser cys 
ID NO: 


etc 
leu 
261 


age 
ser 


ggc 
gly 


cca 
pro 


ccg 
pro 


cca cct 
pro pro 
91/31 


egg 
arg 


teg 
ser 


ccg 
pro 


gca 
ala 


gta 
val 


tgt 
cys 


cag 
gin 


cat 
his 


gtg cag 
val gin 
121/41 


atg 
met 


act 
thr 


cca 
pro 


cgc 
arg 


age 
ser 


ctt 
leu 


gtt 

val 


cgc 
arg 


ate gtt 
ile val 
151/51 


ggt 
gly 


gtc 
val 


gtg 
val 


gtt 

val 


gcg 
ala 


acg 
thr 


acc 
thr 


ttg 
leu 


gcg ctg 
ala leu 
181/61 


gtg 
val 


age 
ser 


gca 
ala 


ccc 
pro 


gcc 
ala 


ggc 
gly 


ggt 
gly 


cgt 
arg 


gcc gcg 
ala ala 
211/71 


cat 
his 


gcg 
ala 


gat 
asp 


ccg 
pro 


tgt 
cys 


teg 
ser 


gac 
asp 


ate 
ile 


gcg gtc 
ala val 
241/81 


gtt 
val 


ttc 
phe 


get 
ala 


cgc 
arg 


ggc 

gly 


acg 
thr 


cat 
his 


cag 
gin 


get tct 
ala ser 
271/91 


ggt 
gly 


ctt 
leu 


ggc 

gly 


gac 
asp 


gtc 
val 


ggt 

gly 


gag 
glu 


gcg 
ala 


ttc gtc 
phe val 
301/101 


gac 
asp 


teg 
ser 


ctt 
leu 


ace 
thr 


teg 
ser 


caa 
gin 


gtt 

val 


ggc 
gly 


ggg egg 

gly arg 
331/111 


teg 
ser 


att 
ile 


ggg 
gly 


gtc 
val 


tac 
tyr 


gcg 

ala 


gtg 
val 


aac 
asn 


tac cca 
tyr pro 
361/121 


gca 
ala 


age 
ser 


gac 
asp 


gac 
asp 


tac 
tyr 


cgc 
arg 


gcg 
ala 


age 
ser 


gcg tea 
ala ser 
391/131 


aac 
asn 


ggt 
gly 


tec 
ser 


gat 
asp 


gat 
asp 


gcg 
ala 


age 
ser 


gcc 
ala 


cac ate 
his ile 
421/141 


cag 
gin 


cgc 
arg 


ace 
thr 


gtc 
val 


gcc 
ala 


age 
ser 


tgc 
cys 


ccg 
pro 


aac acc 
asn thr 
451/151 


agg 
arg 


att 
ile 


gtg 

val 


ctt 
leu 


ggt 
gly 


ggc 
gly 


tat 
tyr 


teg 
ser 


cag ggt 
gin gly 
481/161 


gcg 
ala 


acg 
thr 


gtc 
val 


ate 
ile 


gat 
asp 


ttg 
leu 


tec 
ser 


acc 
thr 


teg gcg 
ser ala 
511/171 


atg 
met 


ccg 
pro 


ccc 
pro 


gcg 

ala 


gtg 
val 


gca 
ala 


gat 
asp 


cat 
his 


gtc gcc 
val ala 
541/181 


get 
ala 


gtc 
val 


gcc 
ala 


ctt 
leu 


ttc 
phe 


ggc 
gly 


gag 
glu 


cca 
pro 


tec agt 
ser ser 
571/191 


ggt 
gly 


ttc 
phe 


tec 
ser 


age 
ser 


atg 
met 


ttg 
leu 


tgg 
trp 


ggc 
gly 


ggc ggg 

gly gly 

601/201 


teg 
ser 


ttg 
leu 


ccg 
pro 


aca 
thr 


ate 
ile 


ggt 

gly 


ccg 
pro 


ctg 
leu 


tat age 
tyr ser 
631/211 


tct 
ser 


aag 
lys 


acc 
thr 


ata 
ile 


aac 
asn 


ttg 
leu 


tgt 
cys 


get 
ala 


ccc gac 
pro asp 
661/221 


gat 
asp 


cca 
pro 


ata 
ile 


tgc 
cys 


acc 
thr 


gga 
gly 


ggc 

gly 


ggc 

gly 


aat att 
asn ile 
691/231 


atg 
met 


gcg 
ala 


cat 
his 


gtt 
val 


teg 
ser 


tat 
tyr 


gtt 
val 


cag 
gin 


teg ggg 
ser gly 


atg 
met 


aca 
thr 


age 
ser 


cag 
gin 


gcg 
ala 


gcg 
ala 


aca 
thr 


ttc 
phe 


gcg gcg 
ala ala 


aac 
asn 


agg 
arg 


etc 
leu 


gat 
asp 


cac 
his 


gcc 

ala 


gga tga 
gly)OPA 



SEQ ID NOS:260-261 

FIG. 13E 



SEQ ID NO: 262 

CCA CCG GGG CTG GAG GGG CGA ATG TGC GCC 
( pro p ro gly leu glu gly arg met cys ala 
6l72T"^SEQ ID NOs 2 63 

GCG GCT GAT CCC CTG GCC CGA GAC GGG GCA 
ala ala asp pro leu ala arg asp gly ala 
121/41 

GTA GCG GTT CGG CGG GAA CCG AGC GCC GAC 
val ala val arg arg glu pro ser ala asp 
181/61 

GAC GGG TAT GGC GGC GAC TGA GGT GAT CTG 

asp gly tyr gly gly asp)OPA(gly asp leu 

241/81 SEQ ID NO: 266—*' 

TTA CGA CCA GGG AAT TTC GAA AAT GTT ATT 

leu arg pro gly asn phe glu asn val ile 

301/101 

ACC CCC TAG GTG TAG TGT TTT CGA GTA CCG 
thr pro) AMB val AMB ( cys phe arg val pro 

SEQ ID NO: 267 



31/11 



GAA CGC 


CGT 


CGG 


CCA 


ACT 


TGG 


CCG 


CTG 


AGG 


glu arg 


arg 


arg 


pro 


thr 


trp 


pro 


leu 


arg 






91/31 










AGC CAA 


TAG 


CGG 


CTC 


CAT 


CGG 


GCT 


TTG 


CTG 


ser gln)AMB<arg 


leu 


his 


arg 


ala 


leu 


leu 


151/51 




^-SEQ ID NO: 


2 64 






GTT GTC 


GGT 


GCC 


CGG 


TGA 


TAT 


ATT 


GGG 


TCA 


val val 


gly 


ala 


arg)OPA( tyr 


ile 


gly 


ser 


211/71 










SEQ 


ID NO: 


265 


CGA CAC 


GCC 


GCC 


GCG 


GTG 


CTC 


GAG 


CCA 


GGC 


arg his 


ala 


ala 


ala 


val 


leu 


glu 


pro 


gly 


271/91 


















CAG AAC 


ATC 


TTG 


TAT 


CTC 


TTC 


CTC 


CGT 


GCC 


gin asn 


ile 


leu 


tyr 


leu 


phe 


leu 


arg 


ala 


331/111 


















GCA GAT 


CCC 


AGT 


TCA 


CCA 


GTC 


TCA 


CCA 


GAT 


ala asp 


pro 


ser 


ser 


pro 


val 


ser 


pro 


asp] 



SEQ XD NOS: 262-267 

FIG. 14A 



^SEQ ID NO: 268 

CAC CGG GGC TGG AGG GGC GAA TGT GCG 
(h is arg gly trp arg gly glu cys ala 
62/21 ^-SEQ ID NO: 269 
CGG CTG ATC CCC TGG CCC GAG ACG GGG 
arg leu ile pro trp pro glu thr gly 
122/41 

TAG CGG TTC GGC GGG AAC CGA GCG CCG 

AMB ( arg phe gly gly asn arg ala pro 

182/61 ^SEQ ID NO: 271 

ACG GGT ATG GCG GCG ACT GAG GTG ATC 

thr gly met ala ala thr glu val ile 

242/81 

TAC GAC CAG GGA ATT TCG AAA ATG TTA 
tyr asp gin gly ile ser lys met leu 
302/101 

CCC CCT AGG TGT AGT GTT TTC GAG TAC 
pro pro arg cys ser val phe glu tyr 



32/11 

CCG AAC GCC GTC GGC CAA CTT GGC CGC TGA GGG 
pro asn ala val gly gin leu gly arg)OPA(gly 

92/31 SEQ ID NO: 270^ 

CAA GCC AAT AGC GGC TCC ATC GGG CTT TGC TGG 

gin ala asn ser gly ser ile gly leu cys trp) 

152/51 

ACG TTG TCG GTG CCC GGT GAT ATA TTG GGT CAG 

thr leu ser val pro gly asp ile leu gly gin 
212/71 

TGC GAC ACG CCG CCG CGG TGC TCG AGC CAG GCT 

cys asp thr pro pro arg cys ser ser gin ala 
272/91 

TTC AGA ACA TCT TGT ATC TCT TCC TCC GTG CCA 

phe arg thr ser cys ile ser ser ser val pro 
332/111 

CGG CAG ATC CCA GTT CAC CAG TCT CAC CAG ATC 

arg gin ile pro val his gin ser his gin ile) 



SEQ XD NOS: 268-271 



FIG. 14B 



p — SEQ ID NO: 274 

ACC GGG GCT GGA GGG GCG AAT GTG CGC CGA 
( ^hr g ly ala gly gly ala asn val arg arg 
63/21^ — SEQ ID NO: 275 

GGC TGA TCC CCT GGC CCG AGA CGG GGC AAG 
gly)OPA(ser pro gly pro arg arg gly lys 
123/41 ^ SEQ ID NO: 276 

AGC GGT TCG GCG GGA ACC GAG CGC CGA CGT 
ser gly ser ala gly thr glu arg arg arg 
183/61 

CGG GTA TGG CGG CGA CTG AGG TGA TCT GCG 
arg val trp arg arg leu arg)OPA<ser ala 

243/81 SEQ ID NO: 277 ' 

ACG ACC AGG GAA TTT CGA AAA TGT TAT TCA 
thr thr arg glu phe arg lys cys tyr ser 
303/101 

CCC CTA GGT GTA GTG TTT TCG AGT ACC GGC 
pro leu gly val val phe ser ser thr gly 



33/11 

ACG CCG TCG GCC AAC TTG GCC GCT GAG GGC 
thr pro ser ala asn leu ala ala glu gly 
9 3/31 

CCA ATA GCG GCT CCA TCG GGC TTT GCT GGT 
pro ile ala ala pro ser gly phe ala gly 
153/51 

TGT CGG TGC CCG GTG ATA TAT TGG GTC AGA 
cys arg cys pro val ile tyr trp val arg 
213/71 

ACA CGC CGC CGC GGT GCT CGA GCC AGG CTT 
thr arg arg arg gly ala arg ala arg leu 
273/91 

GAA CAT CTT GTA TCT CTT CCT CCG TGC CAC 
glu his leu val ser leu pro pro cys his 
333/111 

AGA TCC CAG TTC ACC AGT CTC ACC AGA TC 
arg ser gin phe thr ser leu thr arg) 



SEQ ID NOS: 274-277 

FIG. 14C 



part of the nucleotide sequence of seq!4A 
1^1 ^ SEQ ID NO: 278 31/11 

TTT TCG AGT ACC GGC AGA TCC CAG GTT CAC CAG GTC TCA CCA GAT C 
phe ser ser thr gly arg ser gin val his gin val ser pro asp 
^ SEQ ID NO: 279 

SEQ ID NOS : 278-279 

FIG. 14A" 



1/1 ^ SEQ ID NO: 280 31/11 

TGT TTT CGA GTA CCG GCA GAT CCC AGG TTC ACC AGG TCT CAC CAG ATC 
cys phe arg val pro ala asp pro arg phe thr arg ser his gin ile 
V SEQ ID NO: 281 

SEQ ID NOS: 280-281 

FIG. 14C 



l^J^^SEQ ID NO: 282 31/11 

GTT TTC GAG TAC CGG CAG ATC CCA GGT TCA CCA GGT CTC ACC AGA TC 
val phe glu tyr arg gin ile pro gly ser pro gly leu thr arg 
^ -SEQ ID NO: 283 

SEQ ZD NOS: 282-283 



FIG. 14C 



MAR 2 6 2003 Si 



ORF predicted based on the sequence 
(Nature 393:537-544) and containing seql4A' 



published by Cole et al . 



l£i_^-SEQ ID NO: 284 31/11 

TAG CGG TTC GGC GGG AAG CTA GCG GCG ACG TTG TCG GTG GCC 

AMB(arg phe gly gly lys leu ala ala thr leu ser val ala 

61/21*" SEQ ID NO: 285 91/31 

ACG GGT ATG GCG GCG GCT GAG GTG ATC TGC GAC ACG CCG CCG 

thr gly met ala ala ala glu val ile cys asp thr pro pro 

121/41 151/51 

TAC GAC CAG GGA ATT TCG AAA ATG TTA TTC AGA ACA TCT TGT 

tyr asp gin gly ile ser lys met leu phe arg thr ser cys 

181/61 211/71 

CCC CTA GGT GTA GTG TTT TCG AGT ACC GGC AGA TCC CAG GTT 

pro leu gly val val phe ser ser thr gly arg ser gin val 

241/81 271/91 

cca egg ggc gcg atg aac ttc ccg gca teg gca teg cca ggt 

pro arg gly ala met asn phe pro ala ser ala ser pro gly 

301/101 331/111 

tat gac ggg aat ctg gag cct tgt egg gec get caa cat ate 

tyr asp gly asn leu glu pro cys arg ala ala gin his ile 

361/121 391/131 

teg. ttg cca gat cct gtc aga ttc ccg att tec gca aag gag 

ser leu pro asp pro val arg phe pro ile ser ala lys glu 

421/141 

gac cgt tta cac taa 
asp arg leu his)OCH 



GGT GAT ATA 
gly asp ile 

CGG TGC TCG 
arg cys ser 

ATC TCT TCT 
ile ser ser 

CAC CAG GTC 
his gin val 

cga egg acg 
arg arg thr 

gaa gat gca 
glu asp ala 

egg tac gee 
arg tyr ala 



TTG GGT CAG 
leu gly gin 

AGC CAG GCT 
ser gin ala 

CCG TGC CAC 
pro cys his 

TCA CCA gat 
ser pro asp 

tgg teg cgc 
trp ser arg 

eta ctt gag 
leu leu glu 

cat gac cgt 
his asp arg 



SEQ ZD NOS:284-285 



FIG. 14D 




Sequence Rv3054c predicted by Cole et al . {Nature 393:537-544) 
which may be in phase with Seql4A* 



1/1 _^-SEQ ID NO: 286 










31/11 


















gtg tea 


gat 


acc 


aag 


tec 


gac 


ate 


aaa 


ate 


ttg gee 


tta 


gtg 


gga 


age 


ctg 


cgc 


gcg 


gcg 


(val ser 


asp 


thr 


lys 


ser 


asp 


ile 


lys 


ile 


leu ala 


leu 


val 


gly 


ser 


leu 


arg 


ala 


ala 


61/21 ^ 


— SEQ ID 


NO: 


287 








91/31 


















teg ttc 


aac 


cgc 


cag 


ate 


gee 


gag 


ctg 


get 


gee aag 


gtc 


get 


ccg 


gac 


ggc 


gtc 


acc 


gtc 


ser phe 


asn 


arg 


gin 


ile 


ala 


glu 


leu 


ala 


ala lys 


val 


ala 


pro 


asp 


gly 


val 


thr 


val 


121/41 


















151/51 


















acc atg 


ttc 


gag 


ggg 


ctg 


ggg 


gac 


ctg 


ccg 


ttc tac 


aac 


gaa 


gac 


ate 


gac 


aca 


gcg 


acg 


thr met 


phe 


glu 


gly 


leu 


gly 


asp 


leu 


pro 


phe tyr 


asn 


glu 


asp 


ile 


asp 


thr 


ala 


thr 


181/61 


















211/71 


















gag gtg 


ccg 


gcg 


ccg 


gtg 


age 


gcg 


ttg 


egg 


gag gee 


gcg 


tct 


gac 


gcg 


cac 


get 


gee 


ttg 


glu val 


pro 


ala 


pro 


val 


ser 


ala 


leu 


arg 


glu ala 


ala 


ser 


asp 


ala 


his 


ala 


ala 


leu 


241/81 


















271/91 


















gtg gtc 


acg 


ccg 


gaa 


tac 


aac 


ggc 


age 


att 


ccg gee 


gtg 


ate 


aag 


aac 


gcg 


ate 


gac 


tgg 


val val 


thr 


pro 


glu 


tyr 


asn 


gly 


ser 


ile 


pro ala 


val 


ile 


lys 


asn 


ala 


ile 


asp 


trp 


301/101 


















331/111 


















ctg tec 


agg 


cca 


ttc 


ggc 


gat 


ggc 


gcg 


ttg 


aag gac 


aag 


ccg 


ttg 


gee 


gtg 


ate 


ggc 


ggc 


leu ser 


arg 


pro 


phe 


gly 


asp 


gly 


ala 


leu 


lys asp 


lys 


pro 


leu 


ala 


val 


ile 


gly 


gly 


361/121 


















391/131 


















tec atg 


ggc 


cgc 


tac 


ggc 


ggg 


gta 


tgg 


gcg 


cac gac 


gag 


act 


cgc 


aag 


teg 


ttc 


age 


ate 


ser met 


gly 


arg 


tyr 


gly 


gly 


val 


trp 


ala 


his asp 


glu 


thr 


arg 


lys 


ser 


phe 


ser 


ile 


421/141 


















451/151 


















get ggc 


acg 


egg 


gtg 


gtc 


gat 


gcg 


ate 


aaa 


ctg teg 


gtg 


ccg 


ttc 


caa 


act 


ctg 


ggc 


aag 


ala gly 


thr 


arg 


val 


val 


asp 


ala 


ile 


lys 


leu ser 


val 


pro 


phe 


gin 


thr 


leu 


gly 


lys 


481/161 


















511/171 


















teg gtc 


gcg 


gac 


gac 


gec 


ggg 


ctg 


gcg 


gcg 


aat gtg 


cgc 


gac 


gee 


gtc 


ggc 


aac 


ttg 


gee 


ser val 


ala 


asp 


asp 


ala 


gly 


leu 


ala 


ala 


asn val 


arg 


asp 


ala 


val 


gly 


asn 


leu 


ala 



541/181 

get gag gtc ggc tga 
ala glu val gly)OPA 



SEQ XD NOS: 286-287 



FIG. 14E 



ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv3054c 
3 V1 ^ SEQ ID NO: 288 31/11 

taa cgc gat egg aat aaa teg gac cat ggt ccg gtt ggc teg tgc aag gac gtg gac caa 
OCH(arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 
61/21^ — SEQ ID NO: 289 91/31 

caa gcg gaa agg aac gta gca gtg tea gat ace aag tec gac ate aaa ate ttg gee tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp ile lys ile leu ala leu 
121/41 151/51 

gtg gga age ctg cgc gcg gcg teg ttc aac cgc cag ate gec gag ctg get gee aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
181/61 211/71 

get ccg gac ggc gtc ace gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
241/81 271/91 

gaa gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gee gcg 
glu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala 
301/101 331/111 

tct gac gcg cac get gee ttg gtg gtc acg ccg gaa tac aac ggc age att ccg gee gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
361/121 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 
421/141 451/151 

ccg ttg gee gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
481/161 511/171 

act cgc aag teg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
541/181 571/191 

ccg ttc caa act ctg ggc aag teg gtc gcg gac gac gee ggg ctg gcg gcg aat gtg cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
601/201 631/211 
gac gee gtc ggc aac ttg gee get gag gtc ggc tga 
asp ala val gly asn leu ala ala glu val gly)OPA 



SEQ ID NOS: 288-289 



FIG. 14F 



MAR 2 6 ma S 

SEQ ID NO: 290: first frame nucleic acid 
SEQ ID NO: 297: second frame nucleic acid 
SEQ ID NO: 310: third frame nucleic acid 

fragment based on the sequence published by Cole et al . (Nature 393:537-544) 
and containing seq 14F' and seq 14P*^.SEQ ID NO: 291 
1/l^SEQ ID NO: 290,297,310 — 31/11 

taa cgc gat egg aat aaa tcg /gac cat ggt ccg gtt ggc teg tgc aag gac gtg gac caa 
OCH(arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 
(a sn ala i le gl y ile asn arg thr met val arg leu ala arg ala arg thr trp thr asn 
((thr arg ser glu) OCH( ile\gly pro trp ser gly trp leu val gin gly arg gly pro thr 
6 1/21^ SEQ ID NO: 298 > \ SEQ ID NO: 311 91/31_ SEQ ID NO: 312 

caa gcg gaa agg aac gta gca gtg tea gat acc aag tec gac ate aaa ate ttg gec tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp ile lys ile leu ala leu 
lys arg lys gly thr)AMB(gln cys gin ile pro ser pro thr ser lys ser trp pro ) AMB 
ser gly lys glu arg ser) ser val arg tyr gin val arg his gin asn leu gly leu ser 
121/41 SEQ ID NO: 299-^ 151/51 

gtg gga age ctg cgc gcg gcg teg ttc aac cgc cag ate gec gag ctg get gec aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
(trp glu ala cys ala arg arg arg ser thr ala arg ser pro ser trp leu pro arg ser 

gly\lys pro ala arg gly val val gin pro pro asp arg arg ala gly cys gin gly arg 
lSl/ei^SEQ ID NO: 300 211/71 

get ccg gac ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
leu arg thr ala ser pro ser pro cys ser arg gly trp gly thr cys arg ser thr thr 
ser gly arg arg his arg his his val arg gly ala gly gly pro ala val leu gin arg 
241/81 271/91 

gaa gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gec gcg 
glu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala 
lys thr ser thr gin arg arg arg cys arg arg arg)OPA(ala arg cys gly arg pro arg 
arg his arg his ser asp gly gly ala gly ala -gly gluTarg val ala gly gly arg val) 
301/101 331/111 V — SEQ ID NO: 301 

tct gac gcg cac get gec ttg gtg gtc acg ccg gaa tac aac ggc age att ccg gec gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
leu thr arg thr leu pro trp trp ser arg arg asn thr thr ala ala phe arg pro OPA 
OPA(arg ala arg cys leu gly gly his ala gly ile gin arg gin his ser gly arg asp 
361/123N-SEQ ID NO: 313 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 
(ser arg thr arg ser thr gly cys pro gly his ser ala met ala arg)OPA(arg thr ser 

^gTr\glu arg asp arg leu ala val gin ala ile arg arg trp arg val glu^gly gin ala 
421/141^-SEQ ID NO: 302 451/151 SEQ ID NO: 303 

ccg ttg gec gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
arg trp pro)OPA{ser ala ala pro trp ala ala thr ala gly tyr gly arg thr thr arg 
val gly arg asp^arg arg leu his gly pro leu arg arg gly met gly ala arg arg asp 
481/161 SEQ ID NO: 304 511/171 

act cgc aag teg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
leu ala ser arg ser ala ser leu ala arg gly trp ser met arg ser asn cys arg cys 
ser gin val val gin his arg trp his ala gly gly arg cys asp gin thr val gly ala 
541/181 571/191 

ccg ttc caa act ctg ggc aag teg gtc gcg gac gac gee ggg ctg gcg gcg aat gtg t cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
arg ser lys leu trp ala ser arg ser arg thr thr pro gly trp arg arg met cys ala 
val pro asn ser gly gin val gly arg gly arg arg arg ala gly gly glu cys ala arg 
601/201 631/211 p SEQ ID NO: 292 

gac gee gtc ggc aac ttg gec get gag gtc ggc tga(tcc ctg ggc cga ggc ggg tea gec 
asp' ala val gly asn leu ala ala glu val gly)OPA<ser leu gly arg gly gly ser ala 
thr pro ser ala thr trp pro leu arg ser ala asp pro" trp ala glu ala gly gin pro 
arg arg arg gin leu gly arg)OPA(gly arg leu ile pro gly pro arg arg val ser gin) 
661/221 SEQ ID NO: 314-* 691/231 SEQ ID NO: 293 

aat age ggc tec ate ggc ttt get ggt age ggt teg gcg gga agc\tag(cgg cga cgt tgt 
asn ser gly ser ile gly phe ala gly ser gly ser ala gly ser)AMB(arg arg arg cys 
ile ala ala pro ser ala leu leu val ala val arg arg glu ala ser gly asp val val 
AMB(arg leu his arg leu cys trp) AMB (arg phe gly gly lys leu ala ala thr leu ser 
^SEQ ID NO: 315 ^ SEQ ID NO: 316 



SEQ ID NOS:290-316 

FIG. 14G 




721/241 751/251 SEQ ID NO: 294 

egg tgg ccg gtg ata tat tgg gtc aga egg gta tgg egg egg ctg agg tga\tct gcg aca 
arg trp pro val ile tyr trp val arg arg val trp arg arg leu arg)OPA(ser ala thr 

gly gly arg)OPA(tyr ile gly ser asp gly tyr gly gly gly)OPA(gly asp leu arg his 
val ala gly/asp ile leu gly gin thr gly met ala ala ala/glu val ile cys asp thr 
781/261 SEQ ID NO: 305 811/271 SEQ ID NO: 306 

cgc cgc cgc ggt get cga gee agg ctt acg acc agg gaa ttt cga aaa tgt tat tea gaa 
arg arg arg gly ala arg ala arg leu thr thr arg glu phe arg lys cys tyr ser glu 

ala ala ala val leu glu pro gly leu arg pro gly asn phe glu asn val ile gin asn 
pro pro arg cys ser ser gin ala tyr asp gin gly ile ser lys met leu phe arg thr 

841/281 871/291 SEQ ID NO: 295 

cat ctt gta tct ctt etc cgt gec acc ccc tag gtg tagftgt ttt cga gta ccg gca gat 
his leu val ser leu. leu arg ala thr pro ) AMB val AMB ( cys phe arg val pro ala asp 

ile leu tyr leu phe ser val pro pro pro arg cys ser val phe glu tyr arg gin ile 
ser cys ile ser ser pro cys his pro leu gly val val phe ser ser thr gly arg ser 

901/301 931/311 ^ SEQ ID NO: 296 

ccc agg ttc acc agg tct cac cag ate cac ggg gcg cga tgafact tec egg cat egg cat 
pro arg phe thr arg ser his gin ile his gly ala arg)OPA(thr ser arg his arg his 

pro gly ser pro gly leu thr arg ser thr gly arg asp glu leu pro gly ile gly ile 
gin val his gin val ser pro asp pro arg gly ala met asn phe pro ala ser ala ser 
961/321 991/331 

cgc cag gtc gac gga cgt ggt cgc get atg acg gga ate tgg age ctt gtc ggg ccg etc 
arg gin val asp gly arg gly arg ala met thr gly ile trp ser leu val gly pro leu 
ala arg ser thr asp val val ala leu)OPA(arg glu ser gly ala leu ser gly arg ser 
pro gly arg arg thr trp ser arg tyr/asp gly- asn leu glu pro cys arg ala ala gin 
1021/341 SEQ ID NO: 307 ^ 1051/351 

aac ata teg aag atg cac tac ttg agt cgt tgc cag ate ctg tea gat tec cga ttt ccg 
asn ile ser lys met his tyr leu ser arg cys gin ile leu ser asp ser arg phe pro 
thr tyr arg arg cys thr thr)OPA(val val ala arg ser cys gin ile pro asp phe arg 
his ile glu asp ala leu leu/gTu ser leu pro asp pro val arg phe pro ile ser ala 
1081/361 SEQ ID NO: 308 1111/371 

caa agg age ggt acg ccc atg acc gtg acc gtt tac act aa) 
gin arg ser gly thr pro met thr val thr val tyr thr) 
lys gly ala val arg pro)OPA pro OPA(pro phe thr leu 
lys glu arg tyr ala his asp arg asp} arg leu his)OCH 
SEQ ID NO: 309— 



SEQ ID NOS:290-316 

FIG. 14G(continued) 



1/1^- SEQ ID NO: 317 31/11 



CAA GCC 


CGG 


CCG 


CGA 


CTG 


TTT 


GCC 


GTT 


TTG 


GGG CTC 


CTA 


CCA 


GAA 


CAC 


CAC 


CTG 


GCG 


GCC 


(gin ala 


arg 


pro 


arg 


leu 


phe 


ala 


val 


leu 


gly leu 


leu 


pro 


glu 


his 


his 


leu 


ala 


ala 


61/2 l" — - 


— SEEQ ID NO: 


318 








91/31 


















GCG CAC 


CAT 


GGT 


GTG 


CAC 


CAG 


TTG 


CGA 


TCG 


GTT CCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


CGA 


CGT 


ala his 


his 


gly 


val 


his 


gin 


leu 


arg 


ser 


val pro 


pro 


ala 


arg 


gly 


arg 


arg 


arg 


arg 


121/41 


















151/51 


















CGA TGC 


CCG 


CGC 


CCC 


GGC 


GGC 


GCA 


GCT 


GCG 


TAG CTC 


GAC 


CCG 


GTC 


GAC 


GAC 


GAC 


GGG 


GTC 


arg cys 


pro 


arg 


pro 


gly 


gly 


ala 


ala 


ala) AMB (leu 


asp 


pro 


val 


asp 


asp 


asp 


gly 


val 


181/61- 


















211/71^ 


SEQ 


ID NO : 319 










GGC GGA 


CCA 


GTC 


GGC 


GAT 


GTC 


GAG" 


GCG 


ATG 


GCA ATA 


CAG 


CGC 


CTT 


GGT 


GCG 


CGG 


CCA 'CAC " 


gly gly 


pro 


val 


gly 


asp 


val 


glu 


ala 


met 


ala ile 


gin 


arg 


leu 


gly 


ala 


arg 


pro 


his 


241/81 


















271/91 


















GTC TGA 


GGT 


GGC 


GAA 


GAC 


CAG 


TCC 


CGC 


GCC 


CAC CGG 


CAG 


CCG 


GAT 


CCG 


GAT 


ACG 


CGG 


TAC 


val)OPA(gly 


gly 


glu 


asp 


gin 


ser 


arg 


ala 


his arg 


gin 


pro 


asp 


pro 


asp 


thr 


arg 


tyr) 



^— SEQ ID NO: 32 0 



SEQ ID NOS: 317-320 



FIG. 15A 




SEQ ID NO: 321 32/11 



AAG CCC 


GGC 


CGC 


GAC 


TGT 


TTG 


CCG 


TTT 


TGG 


GGC TCC 


TAC 


CAG 


AAC 


ACC 


ACC 


TGG 


can 


CCG 


lyS pro 


gly 


arg 


asp 


cys 


leu 


pro 


phe 




gly ser 




gin 


asn 


thr 


thr 






pro 


62T21 


— SEQ ID NO: 


322 








92 /3 1 


















CGC ACC 


ATG 


GTG 


TGC 


ACC 


AGT 


TGC 


GAT 


CGG 


TTC CTC 


cca 


\J v_ 


rice. 


riric 




nap 






arg thr 


met 


val 


cys 


thr 


ser 


cys 


asp 


arg 


phe leu 


pro 


arg 


ala 


gly 


gly 


asp 


asp 


val 


122/41 


















152/51 


















GAT GCC 


eric 


rice* 


VJ7 






CAG 


CTG 


CGT 


AGC TCG 


ACC 


CGG 


TCG 


ACG 


ACG 


ACG 


GGG 


TCG 


asp ala 


arg 


ala 


pro 


ala 


ala 


gin 


leu 


arg 


ser ser 


thr 


arg 


ser 


thr 


thr 


thr 


gly 


ser 


182/61 


















212/71 




















CAG 


TCG 


GCG 


ATG 


TCG 


AGG 


CGA 


TGG 


CAA TAC 


AGC 


GCC 


TTG 


GTG 


CGC 


GGC 


CAC 


ACG 


ala asp 


gin 


ser 


ala 


met 


ser 


arg 


arg 


trp 


gin tyr 


ser 


ala 


leu 


val 


arg 


gly 


his 


thr 


242/81 


















272/91 


















TCT GAG 


GTG 


GCG 


AAG 


ACC 


AGT 


CCC 


GCG 


CCC 


ACC GGC 


AGC 


CGG 


ATC 


CGG 


ATA 


CGC 


GGT 


AC 


ser glu 


val 


ala 


lys 


thr 


ser 


pro 


ala 


pro 


thr gly 


ser 


arg 


ile 


arg 


ile 


arg 


gly) 
















SEQ ZD NOS:321- 


322 






























FIG.15B 


- 
















. SEQ 


ID NO: 323 












33/11 


















AGO COG 


GCC 


GCG 


ACT 


GTT 


TGC 


CGT 


TTT 


GGG 


GCT CCT 


ACC 


AGA 


ACA 


CCA 


PPT 




GGC 


CGC 


ser pro 


ala 


ala 


thr 


val 


cys 


arg 


phe 


gly 


ala pro 


thr 


arg 


thr 


pro 


pro 


alv 


crlv 


arg 


63/21"' 


-SEQ 


ID NO: 324 










93/31 


















GCA CCA 


TGG 


TGT 


GCA 


CCA 


GTT 


GCG 


ATC 


GGT 


TCC TCC 


CGC 


GCG 


CGG 


GCG 


GCG 


ACG 


ACG 


TCG 


ala pro 


trp 


cys 


ala 


pro 


val 


ala 


ile 


gly 


ser ser 


arg 


ala 


ara 

-»- 3 


ala 


ala 


thr 


thr 


ser 


123/41 


















153/51 


















ATG CCC 


GCG 


CCC 


CGG 


CGG 


CGC 


AGC 


TGC 


GTA 


GCT CGA 


CCC 


GGT 


CGA 


CGA 


CGA 


CGG 


GGT 


CGG 


met pro 


ala 


pro 


arg 


arg 


arg 


ser 


cys 


val 


ala arg 


pro 


gly 


arg 


arg 


arg 


arg 


gly 


arg 


183/61 


















213/71 


















CGG ACC 


AGT 


CGG 


CGA 


TGT 


CGA 


GGC 


GAT 


GGC 


AAT ACA 


GCG 


CCT 


TGG 


TGC 


GCG 


GCC 


ACA 


CGT 


arg thr 


ser 


arg 


arg 


cys 


arg 


gly 


asp 


gly 


asn thr 


ala 


pro 


trp 


cys 


ala 


ala 


thr 


arg 


243/81 


















273/91 


















CTG AGG 


TGG 


CGA 


AGA 


CCA 


GTC 


CCG 


CGC 


CCA 


CCG GCA 


GCC 


GGA 


TCC 


GGA 


TAC 


GCG 


GTA 


C 


leu arg 


trp 


arg 


arg 


pro 


val 


pro 


arg 


pro 


pro ala 


ala 


gly 


ser 


gly 


tyr 


ala 


val) 



SEQ ID NOS: 323-324 



FIG.15C 



MAR 2 6 2003 SI 

part of the nucleotide sequence of seq!5A 



1/1^- SEQ ID NO s 325 31/11 



GGC GGC 


CGC 


GCG 


CCA 


TGG 


TGT 


GCA 


CCA 


GTT 


GCG ATC 


GGT 


TCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


glygly 


arg 


ala 


pro 


trp 


cys 


ala 


pro 


val 


ala ile 


gly 


ser 


pro 


ala 


arg 


gly 


arg 


arg 


61/21"*^- 


SEQ 


ID NO: 326 










91/31 


















CGA CGT 


CGA 


TGG 


CCG 


CGC 


ccc 


GGC 


GGC 


TGC 


AGC TGC 


GTA 


GCT 


CGA 


CCC 


GGT 


CGA 


CGA 


CGA 


arg arg 


arg 


trp 


pro 


arg 


pro 


gly 


gly 


cys 


ser cys 


val 


ala 


arg 


pro 


gly 


arg 


arg 


arg 


121/41 


















151/51 


















CGG GGT 


CGG 


CGG 


GCC 


AGT 


CGG 


CGA 


TGT 


CGA 


GGC GAT 


GGC 


AAT 


ACA 


GCG 


CCT 


TGG 


TGC 


GCG 


arg gly 


arg 


arg 


ala 


ser 


arg 


arg 


cys 


arg 


gly asp 


gly 


asn 


thr 


ala 


pro 


trp 


cys 


ala 


181/61 


















211/71 


















GCC AC A 


CGT 


CTG 


AGG 


TGG 


CGA 


AGA 


CCA 


GTC 


CCG CGC 


CCA 


CCG 


GCA 


GCC 


GGA 


TC 






ala thr 


arg 


leu 


arg 


trp 


arg 


arg 


pro 


val 


pro arg 


pro 


pro 


ala 


ala 


gly 









SEQ ID NOS: 325-326 

FIG.15A" 



1/1, SEQ ID 


NO: 


327 












31/11 


















GCG GCC 


GCG 


CGC 


CAT 


GGT 


GTG 


CAC 


CAG 


TTG 


CGA TCG 


GTT 


CTC 


CCG 


CGC 


GCG 


GGC 


GGC 


GAC 


[ala ala 


ala 


arg 


his 


gly 


val 


his 


gin 


leu 


arg ser 


val 


leu 


pro 


arg 


ala 


gly 


gly 


asp 


61/21^- 


SEQ 


ID NO: 328 










91/31 


















GAC GTC 


GAT 


GGC 


CGC 


GCC 


CCG 


GCG 


GCT 


GCA 


GCT GCG 


TAG 


CTC 


GAC 


CCG 


GTC 


GAC 


GAC 


GAC 


asp val 


asp 


gly 


arg 


ala 


pro 


ala 


ala 


ala 


ala ala)AMB(leu 


asp 


pro 


val 


asp 


asp 


asp 


121/41 


















151/51 




SEQ ID NO: 


329 






GGG GTC 


GGC 


GGG 


CCA 


GTC 


GGC 


GAT 


GTC 


GAG 


GCG ATG 


GCA 


ATA 


CAG 


CGC 


CTT 


GGT 


GCG 


CGG 


gly val 


gly 


gly 


pro 


val 


gly 


asp 


val 


glu 


ala met 


ala 


ile 


gin 


arg 


leu 


gly 


ala 


arg 


181/61 


















211/71 


















CCA CAC 


GTC 


TGA 


GGT 


GGC 


GAA 


GAC 


CAG 


TCC 


CGC GCC 


CAC 


CGG 


CAG 


CCG 


GAT 


C 






pro his 


val)OPA(gly 


gly 


glu 


asp 


gin 


ser 


arg ala 


his 


arg 


gin 


pro 


asp) 
















— SEQ ID 


NO: 


330 























SEQ ID NOS: 327-330 



FIG. 15B" 



1/1, SEQ ID 


NO: 


331 












31/11 


















TGG CGG 


CCG 


CGC 


GCC 


ATG 


GTG 


TGC 


ACC 


AGT 


TGC GAT 


CGG 


TTC 


TCC 


CGC 


GCG 


CGG 


GCG 


GCG 


trp arg 


pro 


arg 


ala 


met 


val 


cys 


thr 


ser 


cys asp 


arg 


phe 


ser 


arg 


ala 


arg 


ala 


ala 


61/21^- 


SEQ ID NO: 


332 








91/31 


















ACG ACG 


TCG 


ATG 


GCC 


GCG 


CCC 


CGG 


CGG 


CTG 


CAG CTG 


CGT 


AGC 


TCG 


ACC 


CGG 


TCG 


ACG 


ACG 


thr thr 


ser 


met 


ala 


ala 


pro 


arg 


arg 


leu 


gin leu 


arg 


ser 


ser 


thr 


arg 


ser 


thr 


thr 


121/41 


















151/51 


















ACG GGG 


TCG 


GCG 


GGC 


CAG 


TCG 


GCG 


ATG 


TCG 


AGG CGA 


TGG 


CAA 


TAC 


AGC 


GCC 


TTG 


GTG 


CGC 


thr gly 


ser 


ala 


gly 


gin 


ser 


ala 


met 


ser 


arg arg 


trp 


gin 


tyr 


ser 


ala 


leu 


val 


arg 


181/61 


















211/71 


















GGC CAC 


ACG 


TCT 


GAG 


GTG 


GCG 


AAG 


ACC 


AGT 


CCC GCG 


CCC 


ACC 


GGC 


AGC 


CGG 


ATC 






gly his 


thr 


ser 


glu 


val 


ala 


lys 


thr 


ser 


pro ala 


pro 


thr 


gly 


ser 


arg 


ile 







SEQ ID NOS: 331-332 

FIG. 15C 



MAR 2 6 2003 51 

ORF containing SeqlSA' according to Cole et al . (Nature 393:537-544) 
1,/1_^-SEQ ID NO: 333 31/11 

taa ggt ccg cca acg ctt tac get cga egg ceg cca cga gtt ggc egg cca ctt tea ggc 
OCH(gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser gly 
61/2l"* v -SEQ ID NO: 334 91/31 

cgt agt cgc cgc agg gca ggg ctt ccc gcg teg tct teg egg gtt tgt egg caa agg tgt 
arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg cys 
121/41 151/51 

agg ggt age gtt cgt ggg cgt cga cga cga tgt gca get egg gga tgc egg egg cgc ggg 
arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg gly 
181/61 211/71 

egg tgg ggg tgc gca cgc ccg gee gcg act gtt tgc gcg ttt tgg ggc tct gee aga aca 
arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg thr 
241/81 271/91 

cca cct ggc ggc cgc gcg cca tgg tgt gca cca gtt gcg ate ggt tct ccc gcg cgc ggg 
pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg gly 
301/101 331/111 

egg cga cga cgt cga tgg ccg cgc ccc ggc ggc tgc age tgc gta get cga ccc ggt cga 
arg arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly arg 
361/121 391/131 

cga cga egg ggt egg egg gee agt egg cga tgt cga ggc gat ggc aat aca gcg cct tgg 
arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro trp 
421/141 451/151 

tgc gcg gec aca cgt ctg agg tgg cga aga cca gtc ccg cgc cca ccg gca gee gga tea 
cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly ser 
481/161 511/171 

ggt agg gca ggc gcg agt ctt cag egg ggt tgg egg cga cga gca get cca cag agt gtg 
gly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser val 
541/181 571/191 

agQ gta egg gcg gcg tac ggc aac ggt gaa gca ggc act ccg acg aac cca teg tea cgt 
arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser arg 
601/201 

cga agg ggc agg tga 
arg arg gly arg) OPA 



SEQ XD NOS: 333-334 



FIG. 15D 



R:Rv2530c predicted according to Cole et al . (Nature 393:537-544) and which 
may be in phase with SEQ15A 



1/1 ^SEQ ID 


NO: 


335 












31/11 


















crtcr aca 


gca 


ctg 


etc 


gat 


y v — 


aa t 


y •-y 


c tg 


ex. yLy 


L- y 


ggc 


tgg 


ccg 


CtCl L_ 


eac 


y lu 


cac 


[val__thr 


ala 


leu 


leu 


asp 


val 


asn 


va 1 


1 eu 


lie: CL 1CL 




gly 


trp 


pro 


asn 


his 


v d i 


his 


61/21 


^SEQ ID 


HO: 


336 














91/31 












f =» f- CTCCT 


y 


gcg 


cag 


c ga 


*-yy 


t \- c* 

U- L. 


acg 


cag 


ttc tee 


teg 


aaL 


ggg 


tgg 


gee 


acc 


acg 


ccg 


his ala 


ala 


ala 


gin 


arg 




pile 


4-hr 


g In 


T^V^ A C? 

l^lltei oc=l 


ser 


asn 


gly 


trp 


aid 


enr 


enr 


pro 


121/41 


















151/51 


















ate acc 


gag 


gca 


yyy 


tat 


y 




^ t- t- 




age aat 


cgc 


agt 


gtg 


acg 


cag 


gtg 


teg 


acc 


ile thr 


glu 


ala 


y J-y 




val 


dx y 


lie 


ser 


oc=X doll 


arg 


ser 


val 


met 


gin 


Val 


ser 


1- Vi v 

enr 


181/61 


















^ J — L / / I 


















acg ccg 


get 


ate 




ate 


gc t 


cag 


t~ t"cr 


y <-y 


y>— -y ct v_ y 


act 


tct 


ctt 


gec 


ggg 


cac 


acg 


ttt 


thr pro 


ala 


ile 


ala 


ile 


ala 


gin 


leu 


ala 


ala met 


thr 


ser 


leu 


ala 


gly 


his 


thr 


phe 


241/81 


















271/91 


















tgg cct 


gac 


gat 


gtg 


cca 


ctg 


ate 


gtt 


ggg 


age gec 


ggc 


gat 


cgc 


gat 


gcg 


gtg 


tec 


aac 


trp pro 


asp 


asp 


val 


pro 


leu 


ile 


val 


gly 


ser ala 


gly 


asp 


arg 


asp 


ala 


val 


ser 


asn 


301/101 


















331/111 


















cac cgt 


egg 


gtc 


acc 


gac 


tgc 


cat 


etc 


ate 


gee ttg 


gee 


gcg 


cgc 


tac 


ggg 


ggc 


egg 


ttg 


his arg 


arg 


val 


thr 


asp 


cys 


his 


leu 


ile 


ala leu 


ala 


ala 


arg 


tyr 


gly 


gly 


arg 


leu 


361/121 


















391/131 


















gtc aca 


ttc 


gat 


gec 


gca 


ctg 


gec 


gat 


tea 


gca tec 


gca 


ggc 


etc 


gtc 


gag 


gtg 


ttg 


tag 


val thr 


phe 


asp 


ala 


ala 


leu 


ala 


asp 


ser 


ala ser 


ala 


gly 


leu 


val 


glu 


val 


leu) AMB 



SEQ ID NOS:335-336 

FIG. 15E 



SeqlSP: 


ORF 


according to Cole et 


. al , 


. (Nature 393 


>:537 


-544) containing Rv2530c 


1/1 ^ 


SEQ 


ID NO: 337 










31/11 


















tga tgt 


tec 


gee 


gga 


tgc 


gee 


gac 


ggt 


gac 


ttc cga 


gga 


tgt 


cgt 


ccg 


cgc 


get 


cga 


gga 


OPA(cys 


ser 


ala 


gly 


cys 


ala 


asp 


gly 


asp 


phe arg 


gly 


cys 


arg 


pro 


arg 


ala 


arg 


gly 


61/21>-SEQ ID NO: 338 










91/31 


















cga cgt 


gtg 


acg 


gca 


ctg 


etc 


gat 


gtc 


aat 


gtg ctg 


ate 


gcg 


ctg 


ggc 


tgg 


ccg 


aat 


cac 


arg arg 


val 


thr 


ala 


leu 


leu 


asp 


val 


asn 


val leu 


ile 


ala 


leu 


gly 


trp 


pro 


asn 


his 


121/41 


















151/51 


















gtt cac 


cat 


gcg 


gee 


gcg 


cag 


cga 


tgg 


ttc 


acg cag 


ttc 


tec 


teg 


aat 


ggg 


tgg 


gee 


acc 


val his 


his 


ala 


ala 


ala 


gin 


arg 


trp 


phe 


thr gin 


phe 


ser 


ser 


asn 


gly 


trp 


ala 


thr 


181/61 


















211/71 


















acg ccg 


ate 


acc 


gag 


gca 


ggg 


tat 


gtc 


cga 


att tea 


age 


aat 


cgc 


agt 


gtg 


atg 


cag 


gtg 


thr pro 


ile 


thr 


glu 


ala 


gly 


tyr 


val 


arg 


ile ser 


ser 


asn 


arg 


ser 


val 


met 


gin 


val 


241/81 


















271/91 


















teg acc 


acg 


ccg 


get 


ate 


gcg 


ate 


get 


cag 


ttg gcg 


gcg 


atg 


act 


tct 


ctt 


gee 


ggg 


cac 


ser thr 


thr 


pro 


ala 


ile 


ala 


ile 


ala 


gin 


leu ala 


ala 


met 


thr 


ser 


leu 


ala 


gly 


his 


301/101 


















331/111 


















acg ttt 


tgg 


cct 


gac 


gat 


gtg 


cca 


ctg 


ate 


gtt ggg 


age 


gee 


ggc 


gat 


cgc 


gat 


gcg 


gtg 


thr. phe 


trp 


pro 


asp 


asp 


val 


pro 


leu 


ile 


val gly 


ser 


ala 


gly 


asp 


arg 


asp 


ala 


val 


361/121 


















391/131 


















tec aac 


cac 


cgt 


egg 


gtc 


acc 


gac 


tgc 


cat 


etc ate 


gee 


ttg 


gee 


gcg 


cgc 


tac 


ggg 


ggc 


ser asn 


his 


arg 


arg 


val 


thr 


asp 


cys 


his 


leu ile 


ala 


leu 


ala 


ala 


arg 


tyr 


gly 


gly 


421/141 


















451/151 


















egg ttg 


gtc 


aca 


ttc 


gat 


gee 


gca 


ctg 


gee 


gat tea 


gca 


tec 


gca 


ggc 


etc 


gtc 


gag 


gtg 


arg leu 


val 


thr 


phe 


asp 


ala 


ala 


leu 


ala 


asp ser 


ala 


ser 


ala 


gly 


leu 


val 


glu 


val 



481/161 
ttg tag 
leu) AMB 



SEQ ID NOS: 337-338 

FIG. 15F 



/ 

SEQ ZD NO: 339: first frame nucleic acid 
SEQ ID NO: 346: second frame nucleic acid 
SEQ ID NO: 347: third frame nucleic acid 

Fragment containing SeqlSP' and Seq 15F' 
1/1_^SEQ ID NO: 339,346,347 31/11 

tga tgt tec gec gga tgc gec gac ggt gac ttc cga gga tgt cgt ccg cgc get cga gga 
OPA(cys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 
aspwal pro pro asp ala pro thr val thr ser glu asp val val arg ala leu glu asp 
Met\phe arg arg met arg arg arg)OPA(leu pro arg met ser ser ala arg ser arg thr 
61/2T^ ^SEQ ID NO: 340 ^ SEQ ID NO: 348 *\ 91 / 31 ^. SEQ ID NO: 349 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
asp val OPA arg his cys ser met ser met cys OPA ser arg trp ala gly arg ile thr 
thr cys asp gly thr ala arg cys gin cys ala asp arg ala gly leu ala glu ser arg 
121/41 151/51 

gtt cac cat gcg gec gcg cag cga tgg ttc acg cag ttc tec teg aat ggg tgg gec acc 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
phe thr met arg pro arg ser asp gly ser arg ser ser pro arg met gly gly pro pro 
ser pro cys gly arg ala ala met val his ala val leu leu glu trp val gly his his 
181/61 211/71 

acg ccg ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
arg arg ser pro arg gin gly met ser glu phe gin ala ile ala val OPA cys arg cys 
ala asp his arg gly arg val cys pro asn phe lys gin ser gin cys asp ala gly val 
241/81 271/91 

teg acc acg ccg get ate gcg ate get cag ttg gcg gcg atg act tct ctt gec ggg cac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
arg pro arg arg leu ser arg ser leu ser trp arg arg OPA leu leu leu pro gly thr 
asp his ala gly tyr arg asp arg ser val gly gly asp asp phe ser cys arg ala his 
301/101 331/111 

acg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gee ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
arg phe gly leu thr met cys his OPA ser leu gly ala pro ala ile ala met arg cys 
val leu ala)OPA<arg cys ala thr asp arg trp glu arg arg arg ser arg cys gly val 
361/121 ^SEQ ID NO: 350 391/131 

tec aac cac cgt egg gtc acc gac tgc cat etc ate gee ttg gee gcg cgc tac ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
pro thr thr val gly ser pro thr ala ile ser ser pro trp pro arg ala thr gly ala 
gin pro pro ser gly his arg leu pro ser his arg leu gly arg ala leu arg gly pro 
421/141 451/151 

egg ttg gtc aca ttc gat gec gca ctg gee gat tea gca tec gca ggc etc gtc gag gtg 
arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 
gly trp ser his ser met pro his trp pro ile gin his pro gin ala ser ser arg cys 
val gly his ile arg cys arg thr gly arg phe ser ile arg arg pro arg arg gly val 
481/161 SEQ ID NO: 341 511/171 

ttg tag tcaccg/ggg atg ggc ggc teg cca ggc ctg cag gat ctg egg gcg cag gcg ccc 
leu)AMB(ser pro gly met gly gly ser pro gly leu gin asp leu arg ala gin ala pro 
cys ser his arg gly trp ala ala arg gin ala cys arg ile cys gly arg arg arg pro 
val val thr gly asp gly arg leu ala arg pro ala gly ser ala gly ala gly ala pro 
541/181 571/191 

ccg gtc gga cac egg" cag gec gac get ttt ggc cca cgc gcg cag etc ggc get get ggg 
pro val gly his arg gin ala asp ala phe gly pro arg ala gin leu gly ala ala gly 
arg ser asp thr gly arg pro thr leu leu ala his ala arg ser ser ala leu leu gly 
gly arg thr pro ala gly arg arg phe trp pro thr arg ala ala arg arg cys trp ala 
601/201 631/211 

etc ggg etc ggc ggc age egg etc gaa aac cgt ggt ggc gtc ggc ate gtc gac gaa cca 
s leu gly leu gly gly ser arg leu glu asn arg gly gly val gly ile val , asp . glu nr,o , ; 
- ser . gly ser ala .ala ala' gly ser lys * thr val "val ' ala* ser ala ' s'^r • ser thr asn gin 
arg ala arg arg' gin pro ala arg lys pro trp' trp arg arg -his arg arg arg thr arg) 



SEQ ID NOS: 339-352 



FIG. 15G 



\P £ 

mi* 1 *® 



661/221 SEQ ID NO: 342 691/231 

ggt gag ggc ggc ggc tagjata gcg gta ggt gta ttc ctg ggc gag ctt gcg ggt ttg gca 
gly glu gly gly gly)AMB(ile ala val gly val phe leu gly glu leu ala gly leu ala 
val arg ala ala ala arg AMB arg AMB val tyr ser trp ala ser leu arg val trp gin 
OPA(gly arg arg leu asp ser gly arg cys ile; pro gly arg ala cys gly phe gly arg 
721/241** SEQ ID NO: 351 751/251 

gaa cac gat egg cac gtt ggg aaa gec gat ctg caa ttc ggc cag ccc ate ggc gat cgc 
glu his asp arg his val gly lys ala asp leu gin phe gly gin pro ile gly asp arg 
asn thr ile gly thr leu gly lys pro ile cys asn ser ala ser pro ser ala ile ala 
thr arg ser ala arg trp glu ser arg ser ala ile arg pro ala his arg arg ser pro 
781/261 811/271 

cgt egg gcg ggc gaa gga gtg cgc gaa gat etc cga gta gcg gtc etc gac cac cac ggc 
arg . arg ala gly glu. gly val arg glu asp leu arg val ala val leu asp his his gly 
val gly arg ala lys glu cys ala lys ile ser glu AMB arg ser ser thr thr thr ala 
ser gly gly arg arg ser ala arg arg ser pro ser ser gly pro arg pro pro arg arg 
841/281 871/291 

ggc ccg tgg cag cgc ggc cag ttc ggt cag ttg gta ttt cag gtt gec gtt cag cac gec 
gly pro trp gin arg gly gin phe gly gin leu val phe gin val ala val gin his ala 
ala arg gly ser ala ala ser ser val ser trp tyr phe arg leu pro phe ser thr pro 
pro val ala ala arg pro val arg ser val gly ile ser gly cys arg ser ala arg gin 
901/301 931/311 

aga agt aag gtc cgc caa cgc ttt acg etc gac ggc cgc cac gag ttg gee ggc cac ttt 
arg ser lys val arg gin arg phe thr leu asp gly arg his glu leu ala gly his phe 
glu val arg ser ala asn ala leu arg ser thr ala ala thr ser trp pro ala thr phe 
lys)OCH(gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser 
961/321 ^ SEQ ID NO. 352 991/331 

cag gec gta gtc gec gca ggg cag ggc ttc ccg cgt cgt ctt cgc ggg ttt gtc ggc aaa 
gin ala val val ala ala gly gin gly phe pro arg arg leu arg gly phe val gly lys 
arg pro AMB ser pro gin gly arg ala ser arg val val phe ala gly leu ser ala lys 
gly arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg 
1021/341 1051/351 

ggt gta ggg gta gcg ttc gtg ggc gtc gac gac gat gtg cag etc ggg gat gec ggc ggc 
gly val gly val ala phe val gly val asp asp asp val gin leu gly asp ala gly gly 
val AMB gly AMB arg ser trp ala ser thr thr met cys ser ser gly met pro ala ala 
cys arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg 
1081/361 1111/371 

gcg ggc ggt ggg ggt gcg cac gec egg ccg cga ctg ttt gcg cgt ttt ggg get ctg cca 
ala gly gly gly gly ala his ala arg pro arg leu phe ala arg phe gly ala leu pro 
arg ala val gly val arg thr pro gly arg asp cys leu arg val leu gly leu cys gin 
gly arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg 
1141/381 1171/391 

gaa cac cac ctg gcg gee gcg cgc cat ggt gtg cac cag ttg cga teg gtt etc ccg cgc 
glu his his leu ala ala ala arg his gly val his gin leu arg ser val leu pro arg 

asn thr thr trp arg pro arg ala met val cys thr ser cys asp arg phe ser arg ala 
thr pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg 
1201/401 1231/411 SEQ ID NO: 343x. 

gcg ggc ggc gac gac gtc gat ggc cgc gee ccg gcg get gca get gcg tag^ctc gac ccg 
ala gly gly asp asp val asp gly arg ala pro ala ala ala ala ala)AMB(leu asp pro 

arg ala ala thr thr ser met ala ala pro arg arg leu gin leu arg ser ser thr arg 
gly arg arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly 
1261/421 1291/431 

gtc gac gac gac ggg gtc ggc ggg cca gtc ggc gat gtc gag gcg atg gca ata cag cgc 
val asp asp asp gly val gly gly pro val gly asp val glu ala met ala ile gin arg 
ser thr thr thr gly ser ala gly gin ser ala met ser arg arg trp gin tyr ser ala 
arg arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro 



SEQ Xt> NOS:339-352 



FIG. 15G (continued 1) 



1321/441 SEQ ID NO: 344^ 1351/451 

ctt ggt gcg egg cca cac gtc tga\ggt ggc gaa gac cag tec cgc gec cac egg cag ccg 
leu gly ala arg pro his val)OPA(gly gly glu asp gin ser arg ala his arg gin pro 
leu val arg gly his thr ser glu val ala lys thr ser pro ala pro thr gly ser arg 
trp cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly 
1381/461 1411/471 

gat cag gta ggg cag gcg cga gtc ttc age ggg gtt ggc ggc gac gag cag etc cac aga 
asp gin val gly gin ala arg val phe ser gly val gly gly asp glu gin leu his arg 
ile arg AMB gly arg arg glu ser ser ala gly leu ala ala thr ser ser ser thr glu 
ser gly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser 
1441/481 ^ — SEQ ID NO: 345 1471/491 

gtg tga^ggg tac ggg egg cgt acg gca acg gtg aag cag gca etc cga cga acc cat cgt 
val)OPA(gly tyr gly arg arg thr ala thr val lys gin ala leu arg arg thr his arg 
cys glu gly thr gly gly val arg gin arg OPA ser arg his ser asp glu pro ile val 
val arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser 
1501/501 

cac gtc gaa ggg gca ggt ga) 
his val glu gly ala gly) 
thr ser lys gly gin val 
arg arg arg gly arg) OPA 



SEQ ID NOS: 339-352 (continued 2) 

FIG. 15G (continued (2) 



— SEQ ID NO: 353 












31/11 


















TGC GCA 


TGC 


CGA 


CCA 


GTG 


TGG 


TTG 


GCC 


GGA 


GTT CGT 


TTG 


TTC 


GCG 


ATT 


GCC 


TCA 


ACG 


ATT 


(cys ala 


cys 


arg 


pro 


val 


trp 


leu 


ala 


gly 


val arg 


leu 


phe 


ala 


ile 


ala 


ser 


thr 


ile 


61/21^" 


— SEQ ID 


NO: 


354 










91/31 


















CGA TAT 


AAC 


CAC 


TCT 


AGT 


CAC 


ATC 


AAC 


CAC 


ACT CGT 


ACC 


ATC 


GAG 


CGT 


GTG 


GGT 


TCA 


TGC 


arg tyr 


asn 


his 


ser 


ser 


his 


ile 


asn 


his 


thr arg 


thr 


ile 


glu 


arg 


val 


gly 


ser 


cys 


121/41 


















151/51 


















CAT GCA 


TTC 


GCG 


ACC 


GCG 


GGA 


GCC 


GGC 


GAA 


CCC GGC 


GCC 


ACA 


CAT 


AAT 


CCA 


GAT 


TGA 


GGA 


his ala 


phe 


ala 


thr 


ala 


gly 


ala 


gly 


glu 


pro gly 


ala 


thr 


his 


asn 


pro 


asp) OPA (gly 


181/61 


















211/71 






SEQ ID NO: 


355 -* 




GAC TTC 


CGT 


GCC 


GAA 


CCG 


ACG 


CCG 


ACG 


CAA 


GCT TTC 


GAC 


AGC 


CAT 


GAG 


CGC 


GGT 


CGC 


CGC 


asp phe 


arg 


ala 


glu 


pro 


thr 


pro 


thr 


gin 


ala phe 


asp 


ser 


his 


glu 


arg 


gly 


arg 


arg 


241/81 


















271/91 


















CCT GGC 


AGT 


TGC 


AAG 


TCC 


TTG 


TGC 


ATA 


TTT 


TCT TGT 


CTA 


CGA 


ATC 


AAC 


CGA 


AAC 


GAC 


CGA 


pro gly 


ser 


cys 


lys 


ser 


leu 


cys 


ile 


phe 


ser cys 


leu 


arg 


ile 


asn 


arg 


asn 


asp 


arg 


301/101 


















331/111 


















GCG GCC 


CGA 


GCA 


CCA 


TGA 


ATT 


CAA 


GCA 


GGC 


GGC GGT 


GTT 


GAC 


CGA 


CCT 


GCC 


CGG 


CGA 


GCT 


ala^ ala 


arg 


ala 


pro)OPA(ile 


gin 


ala 


gly 


gly gly 


val 


asp 


arg 


pro 


ala 


arg 


arg 


ala 


361/12 


SEQ ID 


NO: 


356-^ 








391/131 


















GAT GTC 


CGC 


GCT 


ATC 


GCA 


GGG 


GTT 


GTC 


CCA 


GTT CGG 


GAT 


C 














asp val 


arg 


ala 


ile 


ala 


gly 


val 


val 


pro 


val arg 


asp) 

















SEQ ID NOS: 353-356 



FIG. 16A 



TP*?** 



SEQ ID NO: 357 

GCG CAT GCC GAC CAG TGT 
( ala h is ala asp gin cys 



62/21 



SEQ XD NO: 358 



GAT ATA ACC ACT CTA GTC 
asp ile thr thr leu val 
122/41 

ATG CAT TCG CGA CCG CGG 



ser arg pro arg 



met his 
182/61 

ACT TCC GTG CCG AAC CGA 
thr ser val pro ash arg 
242/81 

CTG GCA GTT GGA AGT CCT 
leu ala val ala ser pro 
302/101 

CGG CCC GAG CAC CAT GAA 
arg pro glu his his glu 
362/121 

ATG TCC GCG CTA TCG CAG 
met ser ala leu ser gin 



GGT 
gly 

ACA 
thr 

GAG 
glu 

CGC 
arg 

TGT 
cys 

TTC 
phe 

GGG 
gly 



TGG 
trp 

TCA 
ser 

CCG 
pro 

CGA 
arg 

GCA 
ala 

AAG 
lys 

TTG 
leu 



CCG 
pro 

ACC 
thr 

GCG 
ala 

CGC 
arg 

TAT 
tyr 

CAG 
gin 

TCC 
ser 



GAG 
glu 

ACA 
thr 

AAC 
asn 

AAG 
lys 

TTT 
phe 

GCG 
ala 

CAG 
gin 



32/11 
TTC GTT 
phe val 
92/31 
CTC GTA 
leu val 
152/51 
CCG GCG 
pro ala 
212/71 
CTT TCG 
leu ser 
272/91 
CTT GTC 
leu val 
332/111 
GCG GTG 
ala val 
392/131 
TTC GGG 
phe gly 



TGT TCG 
cys ser 

CCA TCG 
pro ser 

CCA CAC 
pro his 

ACA GCC 
thr ala 

TAC GAA 
tyr glu 

TTG ACC 
leu thr 

ATC 
ile) 



CGA TTG CCT CAA CGA TTC 
arg leu pro gin arg phe 

AGC GTG TGG GTT CAT GCC 
ser val trp val his ala 

ATA ATC CAG ATT GAG GAG 
ile ile gin ile glu glu 

ATG AGC GCG GTC GCC GCC 
met ser ala val ala ala 

TCA ACC GAA ACG ACC GAG 
ser thr glu thr thr glu 

GAC CTG CCC GGC GAG CTG 
asp leu pro gly glu leu 



SEQ ID NOS:357-358 

FIG. 16B 



SEQ ID NO: 359 33/11 



CGC ATG 


CCG 


ACC 


AGT 


GTG 


GTT 


GGC 


CGG 


AGT 


TCG TTT 


GTT 


CGC 


GAT 


TGC 


CTC 


AAC 


GAT 


TCG 


(arg met 


pro 


thr 


ser 


val 


val 


gly 


arg 


ser 


ser phe 


val 


arg 


asp 


cys 


leu 


asn 


asp 


ser 


63/21 


SEQ ID NO: 


360 








93/31 


















ATA TAA 


CCA 


CTC 


TAG 


TCA 


CAT 


CAA 


CCA 


CAC 


TCG TAC 


CAT 


CGA 


GCG 


TGT 


GGG 


TTC 


ATG 


CCA 


ile)OCH 


pro 


leu 


AMB ( ser 


his 


gin 


pro 


his 


ser tyr 


his 


arg 


ala 


cys 


gly 


phe 


met 


pro 


123/41 








SEQ ID NO: 


926 


153/51 


















TGC ATT 


CGC 


GAC 


CGC 


GGG 


AGC 


CGG 


CGA 


ACC 


CGG CGC 


CAC 


ACA 


TAA 


TCC 


AGA 


TTG 


AGG 


AGA 


cys ile 


arg 


asp 


arg 


gly 


ser 


arg 


arg 


thr 


arg arg 


his 


thr)OCH(ser 


arg 


leu 


arg 


arg 


183/61 


















213/71 








— SEQ ID 


NO: 


927 




CTT CCG 


TGC 


CGA 


ACC 


GAC 


GCC 


GAC 


GCA 


AGC 


TTT CGA 


CAG 


CCA 


TGA 


GCG 


CGG 


TCG 


CCG 


CCC 


leu pro 


cys 


arg 


thr 


asp 


ala 


asp 


ala 


ser 


phe arg 


gin 


pro)OPA( 


ala 


arg 


ser 


pro 


pro 


243/81 


















273/91 






SEQ ID 


NO: 


928 




TGG CAG 


TTG 


CAA 


GTC 


CTT 


GTG 


CAT 


ATT 


TTC 


TTG TCT 


ACG 


AAT 


CAA 


CCG 


AAA 


CGA 


CCG 


AGC 


trp gin 


leu 


gin 


val 


leu 


val 


his 


ile 


phe 


leu ser 


thr 


asn 


gin 


pro 


lys 


arg 


pro 


ser 


303/101 


















333/111 


















GGC CCG 


AGC 


ACC 


ATG 


AAT 


TCA 


AGC 


AGG 


CGG 


CGG TGT 


TGA 


CCG 


ACC 


TGC 


CCG 


GCG 


AGC 


TGA 


gly pro 


ser 


thr 


met 


asn 


ser 


ser 


arg 


arg 


arg cys)OPA(pro 


thr 


cys 


pro 


ala 


ser >OPA 


363/121 


















393/131 




1 — -SEQ ID NO 


: 929 






TGT CCG 


CGC 


TAT 


CGC 


AGG 


GGT 


TGT 


CCC 


AGT 


TCG GGA 


TC 
















(cys pro 


arg 


tyr 


arg 


arg 


gly 


cys 


pro 


ser 


ser gly) 

















SEQ ID NO: 930 



SEQ ID NOS : 359-360, 926-930 



FIG. 16C 






CAC 


CGA 


TCA 


GTC 


GAT 






(ala gly 


his 


arg 


ser 


Val 


asp 


arg 


Val 


D JL / A 1 


-SEQ ID 


wo : 










VjL-A bib 


Apr; 








CACj 


CjAC 


bLG 


did va 1 


thr 


thr 


pro 


ala 


gin 


asp 


aia 


tot / yi 1 


















CCT 


TTT 


• 

TTT 


CA 1 


CAA 


CGT 


TGG 


giy ser 


pro 


phe 


phe 


his 


gin 


arg 


trp 


181/61 
















TTC GGT 


CAC 


CTG 


ACG 


GTC 


GGG 


ATG 


TTC 


phe gly 


his 


leu 


thr 


val 


gly 


met 


phe 


241/81 
















TGC CCT 


GCT 


GGT 


GCG 


GCG 


TTC 


GGC 


CGA 


cys pro 


ala 


gly 


ala 


ala 


phe 


gly 


arg 


301/101 
















GTC GAT 


GGC 


CCT 


CAA 


CTC 


GGC 


ATC 


GCG 


val asp 


gly 


pro 


gin 


leu 


gly 


ile 


ala 



31/11 

GTT TCC GCT CCA TCA GCC CGG AAT TGA GGT GCC 
val ser ala pro ser ala arg asn) OPA (gly^ala 

91/31 SEQ ID NO: 363^ 
CCG TTG GTG TTT CCC TCT GTT GCT TTC CCG TCC 
pro leu val phe pro ser val ala phe pro ser 
151/51 

ACT GCC GCA GTG GCG ATG TTG GTC GCC GGC GTG 
thr ala ala val ala met leu val ala gly val 
211/71 

CTT GGG TCT CGG GTT GCT GCT GGG TTT GCT CAA 
leu gly ser arg val ala ala gly phe ala gin 
271/91 

GTC GAT CAC CGC CAA AGA GCA CCC GTT AAA ACG 
val asp his arg gin arg ala pro val lys thr 
331/111 

ACT GGC GAT TAT CAC CAT GCC TCG GGC TGA TC 
thr gly asp tyr his his ala ser gly)OPA 



SEQ ID NOS: 361-363 

FIG. 17A 



0 — SEQ ID NO: 364 



CGG GCC 


ACC 


GAT 


CAG 


TCG 


ATC 


GGG 


TGG 


TTT 


(arg ala 


thr 


asp 


gin 


ser 


ile 


gly 


trp 


phe 


62/21^SEQ ID NO: 365 










CAG TGA 


CGA 


CAC 


CAG 


CGC 


AGG 


ACG 


CGC 


CGT 


gln)OPA(arg 


his 


gin 


arg 


arg 


thr 


arg 


arg 


122/41 


— SEQ ID NO: 366 








GCT CGC 


CTT 


TTT 


TTC 


ATC 


AAC 


GTT 


GGA 


CTG 


ala arg 


leu 


phe 


phe 


ile 


asn 


val 


gly 


leu 


182/61 


















TCG GTC 


ACC 


TGA 


CGG 


TCG 


GGA 


TGT 


TCC 


TTG 


ser val 


thr)OPA(arg 


ser 


gly 


cys 


ser 


leu 


242/81 






^-SEQ ID NO: 


: 367 




GCC CTG 


CTG 


GTG 


CGG 


CGT 


TCG 


GCC 


GAG 


TCG 


ala leu 


leu 


val 


arg 


arg 


ser 


ala 


glu 


ser 


302/101 


















TCG ATG 


GCC 


CTC 


AAC 


TCG 


GCA 


TCG 


CGA 


CTG 


ser met 


ala 


leu 


asn 


ser 


ala 


ser 


arg 


leu 



SEQ ID 



32/11 

CCG CTC CAT CAG CCC GGA ATT GAG GTG CCG 
pro leu his gin pro gly ile glu val pro 
92/31 

TGG TGT TTC CCT CTG TTG CTT TCC CGT CCG 
trp cys phe pro leu leu leu ser arg pro 
152/51 

CCG CAG TGG CGA TGT TGG TCG CCG GCG TGT 
pro gin trp arg cys trp ser pro ala cys 
212/71 

GGT CTC GGG TTG CTG CTG GGT TTG CTC AAT 
gly leu gly leu leu leu gly leu leu asn 
272/91 

ATC ACC GCC AAA GAG CAC CCG TTA AAA CGG 
ile thr ala lys glu his pro leu lys arg 
332/111 

GCG ATT ATC ACC ATG CCT CGG GCT GAT C 
ala ile ile thr met pro arg ala asp) 



;364-367 



FIG. 17B 



^-SEQ ID NO: 368 33/11 



GGG CCA 


CCG 


ATC 


AGT 


CGA 


TCG 


GGT 


GGT 


TTC 


CGC TCC 


ATC 


AGC 


CCG 


GAA 


TTG 


AGG 


TGC 


CGC 


(gly pro 


pro 


ile 


ser 


arg 


ser 


gly 


gly 


phe 


arg ser 


ile 


ser 


pro 


glu 


leu 


arg 


cys 


arg 


63/21^- 


SEQ 


ID NO: 369 










93/31 


















AGT GAC 


GAC 


ACC 


AGC 


GCA 


GGA 


CGC 


GCC 


GTT 


GGT GTT 


TCC 


CTC 


TGT 


TGC 


TTT 


CCC 


GTC 


CGG 


ser asp 


asp 


thr 


ser 


ala 




arg 


ala 


val 


gly val 


ser 


leu 


cys 


cys 


phe 


pro 


val 


arg 


123/41 


















153/51 


















CTC GCC 


TTT 


TTT 


TCA 


TCA 


ACG 


TTG 


GAC 


TGC 


CGC AGT 


GGC 


GAT 


GTT 


GGT 


CGC 


CGG 


CGT 


GTT 


leu ala 


phe 


phe 


ser 


ser 


thr 


leu 


asp 


cys 


arg ser 


gly 


asp 


val 


gly 


arg 


arg 


arg 


val 


183/61 


















213/71 


















CGG TCA 


CCT 


GAC 


GGT 


CGG 


GAT 


GTT 


CCT 


TGG 


GTC TCG 


GGT 


TGC 


TGC 


TGG 


GTT 


TGC 


TCA 


ATG 


arg ser 


pro 


asp 


gly 


arg 


as P 


val 


pro 


trp 


val ser 


gly 


cys 


cys 


trp 


val 


cys 


ser 


met 


243/81 


















273/91 


















CCC TGC 


TGG 


TGC 


GGC 


GTT 


CGG 


CCG 


AGT 


CGA 


TCA CCG 


CCA 


AAG 


AGC 


ACC 


CGT 


TAA 


AAC 


GGT 


pro cys 


trp 


cys 


gly 


val 


arg 


pro 


ser 


arg 


ser pro 


pro 


lys 


ser 


thr 


arg)OCH(asn 


gly 


303/101 


















333/111 




SEQ 


ID NO: 370 — 








CGA TGG 


CCC 


TCA 


ACT 


CGG 


CAT 


CGC 


GAC 


TGG 


CGA TTA 


TCA 


CCA 


TGC 


CTC 


GGG 


CTG 


ATC 




arg trp 


pro 


ser 


thr 


arg 


his 


arg 


asp 


trp 


arg leu 


ser 


pro 


cys 


leu 


gly 


leu 


ile) 



SEQ ID NOS: 368-370 

FIG. 1 7C 



part of the nucleotide sequence of seq!7A 
1/1_^SEQ ID NO: - 371 31/11 

(ggc tag aac ccc gaa gga gac etc gcg ggt tgc egg ccc ccg gec cat egg atg cgt ate 
gly AMB (asn pro glu gly asp leu ala gly cys arg pro pro ala his arg met arg ile 
61/21 6 SEQ ID NO: 372 91/31 

egg teg cgc cga ttc acg acc gac ata ggg age tac ccc ttg ggt gat tec ggt gcg acg 
arg ser arg arg phe thr thr asp ile gly ser tyr pro leu gly asp ser gly ala thr 
121/41 151/51 

act gcg ata cgc teg gcg ggc cac cga tea gtc gat egg gtg gtt tec get cca tea gee 
thr ala ile arg ser ala gly his arg ser val asp arg val val ser ala pro ser ala 
181/61 211/71 

egg aat tga ggt gec gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct 
arg asn)OPA(gly ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser 
241/81 ^ SEQ ID NO: 373 271/91 

gtt get ttc cgt ccg gtt cgc ctt ttt ttc ate aac gtt gga ctg gee gca gtg gcg atg 
val ala phe arg pro val arg leu phe phe ile asn val gly leu ala ala val ala met 
301/101 331/111 

ttg gtc gee ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg 
leu val ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu 
361/121 391/131 

ctg ggt ttg etc aat gec ctg ctg gtg egg cgt teg gec gag teg ate acc gee aaa gag 
leu gly leu leu asn ala leu leu val arg arg ser ala glu ser ile thr ala lys glu 
421/141 451/151 

cac ccg tta aaa egg teg atg gee etc aac teg gca teg cga ctg gcg att ate acc ate 
his pro leu lys arg ser met ala leu asn ser ala ser arg leu ala ile ile thr ile 
481/161 

etc ggg ctg ate 
leu gly leu ile) 



SEQ ID NOS: 371-373 



if FIG. 17 A' 



1 / 1 ^ SEQ ID 


NO: 


374 












31/11 


















get aga 


acc 


ccg 


aag 


gag 


acc 


teg 


egg 


gtt 


gee ggc 


ccc 


egg 


ccc 


acc 


gga 


tgc 


g Ua 


tec 


(ala arg 


thr 


pro 


lys 


glu 


thr 


ser 


arg 


val 


ala g ly 


pro 


arg 


pro 


-i 1 a 

lie 


gly 


cys 


Val 


ser 


61/21^SEQ ID NO: 375 










Q 1 / ~* 1 


















ggt cgc 


gec 


gat 


tea 


cga 


ccg 


aca 


tag 


gga 


CfC t clCC 


cc t 


tgg 


gtg 


a L C 


ccg 


gtg 


cga 


cga 


gly arg 


ala 


asp 


ser 


arg 


pro 


thr)AMB(gly 


ala thr 


pro 


crp 


V cl 1 


lie 


pro 


Vd 1 


arg 


arg 


19 1 / A1 
1 £ 1 / 11 








SEQ 


ID NO: 376<~* 


ICI /CI 
1 Jl/ Jl 


















ctg cga 


tac 


get 


egg 


egg 


gee 


acc 


gat 


cag 


teg ate 


ggg 


tgg 


4— t- 

c t u 


ccg 


etc 


cat 


cag 


ccc 


leu arg 


tyr 


ala 


arg 


arg 


ala 


thr 


asp 


gin 


ser ile 


gly 


trp 


pne 


pro 


leu 


nis 


gin 


pro 


181/61 


















Oil / T 1 

A 11 / / I 


















gga att 


gag 


gtg 


ccg 


cag 


tga 


cga 


cac 


cag 


cgc agg 


acg 


cgc 


cgt 


tgg 


tgt 


ttc 


cc t 


c tg 


gly ile 


glu 


val 


pro 


gln)OPA(arg 


his 


gin 


arg arg 


tnr 


arg 


arg 


trp 


cys 


phe 


pro 


leu 


241/81 


SEQ ID NO: 


377- ' 








TT1 /Ql 


















ttg ctt 


tec 


gtc 


egg 


ttc 


gee 


ttt 


ttt 


tea 


tea acg 


ttg 


gac 


tgg 


ccg 


cag 


tgg 


cga 


tgt 


leu leu 


ser 


val 


arg 


phe 


ala 


phe 


phe 


ser 


ser thr 


leu 


asp 


trp 


pro 


g in 


trp 


arg 


cys 


301/101 


















331/111 


















tgg teg 


ccg 


gcg 


tgt 


teg 


gtc 


acc 


tga 


egg 


teg gga 


tgt 


tct 


tgg 


gtc 


teg 


ggt 


tgc 


tgc 


trp ser 


pro 


ala 


cys 


ser 


val 


thr)OPA(arg 


ser gly 


cys 


ser 


trp 


val 


ser 


gly 


cys 


cys 


361/121 






SEQ ID NO: 


378 > 




391/131 


















tgg gtt 


tgc 


tea 


atg 


ccc 


tgc 


tgg 


tgc 


ggc 


gtt egg 


ccg 


agt 


cga 


tea 


ccg 


cca 


aag 


age 


trp val 


cys 


ser 


met 


pro 


cys 


trp 


cys 


gly 


val airg 


pro 


ser 


arg 


ser 


pro 


pro 


lys 


ser 


421/141 


















451/151 


















acc cgt 


taa 


aac 


ggt 


cga 


tgg 


ccc 


tea 


act 


egg cat 


cgc 


gac 


tgg 


cga 


tta 


tea 


cca 


tec 


thr arg)OCH{asn 


gly 


arg 


trp 


pro 


ser 


thr 


arg his 


arg 


asp 


trp 


arg 


leu 


ser 


pro 


ser 



481/161 * — SEQ ID NO: 379 

teg ggc tga tc 
ser gly)OPA 

SEQ ID NOS: 374-379 



FIG. 17B' 



1 / 1 ^ SEQ ID NO: 380 










31/11 


















eta gaa 


ccc 


cga 


agg 


aga 


cct 


cgc 


ggg 


ttg 


ccg gee 


ccc 


ggc 


cca 


teg 


gat 


gcg 


tat 


ccg 


leu glu 


pro 


arg 


arg 


arg 


pro 


arg 


gly 


leu 


pro ala 


pro 


gly 


pro 


ser 


asp 


ala 


tyr 


pro 


61/21^SEQ ID NO: 381 










91/31 


















gtc gcg 


ccg 


att 


cac 


gac 


cga 


cat 


agg 


gag 


eta ccc 


ctt 


ggg 


tga 


ttc 


egg 


tgc 


gac 


gac 


val ala 


pro 


ile 


his 


asp 


arg 


his 


arg 


glu 


leu pro 


leu 


gly)OPA(phe 


arg 


cys 


asp 


asp 


121/41 


















151/51 








^-SEQ ID NO: 


382 


tgc gat 


acg 


etc 


ggc 


ggg 


cca 


ccg 


ate 


agt 


cga teg 


ggt 


ggt 


ttc 


cgc 


tec 


ate 


age 


ccg 


cys asp 


thr 


leu 


gly 


gly 


pro 


pro 


ile 


ser 


arg ser 


gly 


gly 


phe 


arg 


ser 


ile 


ser 


pro 


181/61 


















211/71 


















gaa ttg 


agg 


tgc 


cgc 


agt 


gac 


gac 


acc 


age 


gca gga 


cgc 


gec 


gtt 


ggt 


gtt 


tec 


etc 


tgt 


glu leu 


arg 


cys 


arg 


ser 


asp 


asp 


thr 


ser 


ala gly 


arg 


ala 


val 


gly 


val 


ser 


leu 


cys 


241/81 


















271/91 


















tgc ttt 


ccg 


tec 


ggt 


teg 


cct 


ttt 


ttt 


cat 


caa cgt 


tgg 


act 


ggc 


cgc 


agt 


ggc 


gat 


gtt 


cys phe 


pro 


ser 


gly 


ser 


pro 


phe 


phe 


his 


gin arg 


trp 


thr 


gly 


arg 


ser 


gly 


asp 


val 


301/101 


















331/111 


















ggt cgc 


egg 


cgt 


gtt 


egg 


tea 


cct 


gac 


ggt 


egg gat 


gtt 


ctt 


ggg 


tct 


egg 


gtt 


get 


get 


gly arg 


arg 


arg 


val 


arg 


ser 


pro 


asp 


gly 


arg asp 


val 


leu 


gly 


ser 


arg 


val 


ala 


ala 


361/121 


















391/131 


















ggg ttt 


get 


caa 


tgc 


cct 


get 


ggt 


gcg 


gcg 


ttc ggc 


cga 


gtc 


gat 


cac 


cgc 


caa 


aga 


gca 


gly; phe 


- a>la - 


gin. ■■cys 


■pro- 


ala 


gly. ala 


ala 


phe gly arg. 


val 


.asp 


-his 


.arg 


gin 


arg 


-ala^ 


4217141 


















451/151 


















ccc, gtt 


aaa 


acg 


gtc 


gat 


ggc 


cct 


caa 


etc 


ggc ate 


gcg 


act 


ggc 


gat 


tat 


cac * 


cat 


cct 


pro val 


lys 


thr 


val 


asp 


gly 


pro 


gin 


leu 


gly ile 


ala 


thr 


gly 


asp 


tyr 


his 


his' 


pro 


481/161 




































egg get 


gat 


c 
































arg ala 


asp) 

































SEQ ID NOS:380-382 

FIG. 17C 



sequence Rvl303 predicted by Cole et al . (Nature 393:537-544) and partially 
containing Seql7A* 



1/1_ SEQ ID NO: 383 










31/11 


















a eg acg 


aca 


cca 


gcg 


cag 


gac 


gcg 




u eg 


gtg ttt 




tc t 


g 1 1 


get 


ttc 


cgt 




gt t 


(met thr 


thr 


pro 


aj.a 


gin 


asp 


ala 


pro 


leu 


v ax £s a a v— 


pro 


ser 


val 


ala 


phe 


arg 


pro 


va 1 




— SEQ ID 


NU » 


.3 O* 










j1/ Jl 


















cgc ctt 






ate 


aac 


gec 


gga 


ctg 


gec 


y v_d. y uy 


gcg 






gtc 


gec 


ggc 


gtg 


ttc 


arg leu 


phe 


phe 


ne 


asn 


val 


y A y 


leu 


ala 


GLJLGl VOX 


ala 




leu 


val 


ala 


gly 


val 


phe 




















151/51 




















ctg 


acg 


gtc 


999 


a tg 


ttc 


u i-y 


yy L 


yyy 


ttg 




c t~ cr 
v- uy 


era t~ 

yy 


<- v-y 


etc 


aat 


gec 


y ly nis 


leu 


thr 


VdJL 


giy 


me t 


phe 


leu 


y J-y 


leu gly 


1 eu 


leu 


leu 




leu 


leu 


asn 


ala 


181/61 


















211/ 7 1 


















ty t-y 


gtg 


egg 


eg t 


teg 


gec 


cracr 


teg 


ate 


acc gee 


aaa 


ciacr 

y *-»-y 


cac 


cca 


tta 


aaa 


ccrcr 

, — y y 


teg 


leu leu 


val 


arg 


arg 


ser 


ala 


glu 


ser 


ile 


thr ala 


lvs 


crlu 


his 


pro 


leu 


lvs 


arcf 


ser 


241/81 


















271/91 


















atg gec 


etc 


aac 


teg 


gca 


teg 


cga 


ctg 


gcg 


att ate 


acc 


ate 


etc 


ggg 


ctg 


ate 


ate 


gee 


met ala 


leu 


asn 


ser 


ala 


ser 


arg 


leu 


ala 


ile ile 


thr 


ile 


leu 


gly 


leu 


ile 


ile 


ala 


301/101 


















331/111 


















tac att 


ttc 


egg 


ccc 


get 


gga 


ttg 


ggc 


gtc 


gtg ttc 


ggg 


ctg 


gca 


ttc 


ttc 


cag 


gtg 


ctg 


tyr ile 


phe 


arg 


pro 


ala 


gly 


leu 


gly 


val 


val phe 


gly 


leu 


ala 


phe 


phe 


gin 


val 


leu 


361/121 


















391/131 


















ctg gtg 


gca 


acg 


acg 


gec 


ctg 


ccg 


gtc 


ctg 


aag aag 


ctg 


cgc 


act 


gcg 


acc 


gag 


gaa 


ccg 


leu val 


ala 


thr 


thr 


ala 


leu 


pro 


val 


leu 


lys lys 


leu 


arg 


thr 


ala 


thr 


glu 


glu 


pro 


421/141 


















451/151 


















gtc gca 


act 


tat 


tct 


tec 


aat 


ggc 


cag 


acc 


ggg gga 


teg 


gaa 


gga 


agg 


age 


gee 


age 


gat 


val ala 


thr 


tyr 


ser 


ser 


asn 


gly 


gin 


thr 


gly gly 


ser 


glu 


gly 


arg 


ser 


ala 


ser 


asp 



481/161 
gac tga 
asp)OPA 

SEQ XD NOS: 383-384 



FIG. 17D 



Orf according to Cole et al . (Nature 39~ 
1^1^-SEQ ID NO: 385 

tga ggt gee gca gtg acg aca cca gcg cag 
OPA(gly ala ala val thr thr pro ala gin 
61/21*" SEQ ID NO: 386 

ttc cgt ccg gtt cgc ctt ttt ttc ate aac 
phe arg pro val arg leu phe phe ile asn 
121/41 

gec ggc gtg ttc ggt cac ctg acg gtc ggg 
ala gly val phe gly his leu thr val gly 
181/61 

ttg etc aat gee ctg ctg gtg egg cgt teg 
leu leu asn ala leu leu val arg arg ser 
241/81 

tta aaa egg teg atg gec etc aac teg gca 
leu lys arg ser met ala leu asn ser ala 
301/101 

ctg ate ate gee tac att ttc egg ccc get 
leu ile ile ala tyr ile phe arg pro ala 
361/121 

ttc cag gtg ctg ctg gtg gca acg acg gee 
phe gin val leu leu val ala thr thr ala 
421/141 

acc gag gaa ccg gtc gca act tat tct tec 
thr glu glu pro val ala thr tyr ser ser 
481/161 

age gee age gat gac tga 
ser ala ser asp asp)OPA 

SEQ X 



: 537-544) and containing Rvl303 



31/11 


















gac gcg 


ccg 


ttg 


gtg 


ttt 


ccc 


tct 


gtt 


get 


asp ala 


pro 


leu 


val 


phe 


pro 


ser 


val 


ala 


91/31 


















gtt gga 


ctg 


gec 


gca 


gtg 


gcg 


atg 


ttg 


gtc 


val gly 


leu 


ala 


ala 


val 


ala 


met 


leu 


val 


151/51 


















atg ttc 


ttg 


ggt 


etc 


ggg 


ttg 


ctg 


ctg 


ggt 


met phe 


leu 


gly 


leu 


giy 


leu 


leu 


leu 


giy 


211/71 


















gee gag 


teg 


ate 


acc 


gec 


aaa 


gag 


cac 


ccg 


ala glu 


ser 


ile 


thr 


ala 


lys 


glu 


his 


pro 


271/91 


















teg cga 


ctg 


gcg 


att 


ate 


acc 


ate 


etc 


ggg 


ser arg 


leu 


ala 


ile 


ile 


thr 


ile 


leu 


giy 


331/111 


















gga ttg 


ggc 


gtc 


gtg 


ttc 


ggg 


ctg 


gca 


ttc 


gly leu 


gly 


val 


val 


phe 


giy 


leu 


ala 


phe 


391/131 


















ctg ccg 


gtc 


ctg 


aag 


aag 


ctg 


cgc 


act 


gcg 


leu pro 


val 


leu 


lys 


lys 


leu 


arg 


thr 


ala 


451/151 


















aat ggc 


cag 


acc 


ggg 


gga 


teg 


gaa 


gga 


agg 


asn gly 


gin 


thr 


gly 


giy 


ser 


glu 


giy 


arg 



NOS:385-386 



FIG. 17E 



MAR 2 6 2B03 ofi] 



SEQ ID NO: 387 31/11 

GTC GAA CAG GTA CGG AAG GCG CCG TCG GTC GCT CGG TCC GCT GGT ATC TCG TGT TCA GCC 
(yalglu gin val arg lys ala pro ser val ala arg ser ala gly ile ser cys ser ala 
61/21 "SEQ ID NO: 388 91/31 

AGC CAG CGG CCG TTA ACG TGG CCG AAC AGG TCG TCT TGG GGT CGG GCA TCA GCG TCG ATG 
ser gin arg pro leu thr trp pro asn arg ser ser trp gly arg ala ser ala ser met 
121/41 151/51 

TGG CTC AGG TCG ATA CCC GAG GGG ATG GCA AGT GTC ACC CCG CCA TCC TTC CAC CTC TTT 
trp leu arg ser ile pro glu gly met ala ser val thr pro pro ser phe his leu phe 
181/61 211/71 

TCG GGT GCA ACG ATC GGG CCA TGC CTG ACG GGG AGC AGA GCC AGC CAC CGG CCC AAG AAG 
ser gly ala thr ile gly pro cys leu thr gly ser arg ala ser his arg pro lys lys 
241/81 271/91 

ATG CGG AAG ACG ACT CGC GGC CCG ACG CCG CGG AGG CCG CCG CGG CCG AAC CCA AAT CAT 
met arg lys thr thr arg gly pro thr pro arg arg pro pro arg pro asn pro asn his 
301/101 331/111 

CAG CCG GTC CCG ATG TTC TCG ACC TAC GGT ATC GCC TCG ACA CTA CTC GGC GTG CTA TCG 
gin pro val pro met phe ser thr tyr gly ile ala ser thr leu leu gly val leu ser 
361/121 

GTC GCC GCG GTC GTG CTG GGT GCG ATG ATC 
val ala ala val val leu gly ala met ile) 

SEQ ID NOS: 387-388 

FIG.18A 



SEQ ID NO: 389 



TCG AAC 


AGG 


TAC 


GGA 


AGG 


CGC 


CGT 


CGG 


TCG 


( ser asn 


arg 


tyr 


gly 


arg 


arg 


arg 


arg 


ser 


62721 


— SEQ ID 


NO: 


390 










GCC AGC 


GGC 


CGT 


TAA 


CGT 


GGC 


CGA 


ACA 


GGT 


ala ser 


gly 


arg)OCH(arg 


gly 


arg 


thr 


gly 


122/41 








^-SEQ ID NO: 391 


GGC TCA 


GGT 


CGA 


TAC 


CCG 


AGG 


GGA 


TGG 


CAA 


gly ser 


gly 


arg 


tyr 


pro 


arg 


gly 


trp 


gin 


182/61 


















CGG GTG 


CAA 


CGA 


TCG 


GGC 


CAT 


GCC 


TGA 


CGG 


arg val 


gin 


arg 


ser 


gly his 


ala)OPA(arg 


242/81 




SEQ ID NO: 


: 392 






TGC GGA 


AGA 


CGA 


CTC 


GCG 


GCC 


CGA 


CGC 


CGC 


cys gly 


arg 


arg 


leu 


ala 


ala 


arg 


arg 


arg 


302/101 


















AGC CGG 


TCC 


CGA 


TGT 


TCT 


CGA 


CCT 


ACG 


GTA 


ser arg 


ser 


arg 


cys 


ser 


arg 


pro 


thr 


val 


362/121 


















TCG CCG 


CGG 


TCG 


TGC 


TGG 


GTG 


CGA 


TGA 


TC 


ser pro 


arg 


ser 


cys 


trp 


val 


arg)OPA 





32/11 

CTC GGT CCG CTG GTA TCT CGT GTT CAG CCA 
leu gly pro leu val ser arg val gin pro 
92/31 

CGT CTT GGG GTC GGG CAT CAG CGT CGA TGT 
arg leu gly val gly his gin arg arg cys 
152/51 

GTG TCA CCC CGC CAT CCT TCC ACC TCT TTT 
val ser pro arg his pro ser thr ser phe 
212/71 

GGA GCA GAG CCA GCC ACC GGC CCA AGA AGA 
gly ala glu pro ala thr gly pro arg arg 
272/91 

GGA GGC CGC CGC GGC CGA ACC CAA ATC ATC 
gly gly arg arg gly arg thr gin ile ile 
332/111 

TCG CCT CGA CAC TAC TCG GCG TGC TAT CGG 
ser pro arg his tyr ser ala cys tyr arg 



SEQ ZD NOS: 389-392 



FIG. 18B 



3/1^^ SEQ ID NO: 393 

CGA AC A GGT ACG GAA GGC GCC GTC GGT CGC 
(argthr gly thr glu gly ala val gly arg 
6*5/21 ^SEQ ID NO: 394 

CCA GCG GCC GTT AAC GTG GCC GAA CAG GTC 
pro ala ala val asn val ala glu gin val 
123/41 

GCT CAG GTC GAT ACC CGA GGG GAT GGC AAG 
ala gin val asp thr arg gly asp gly lys 
183/61 

GGG TGC AAC GAT CGG GCC ATG CCT GAC GGG 
gly cys asn asp arg ala met pro asp gly 
243/81 

GCG GAA GAC GAC TCG CGG CCC GAC GCC GCG 
ala glu asp asp ser arg pro asp ala ala 
303/101 

GCC GGT CCC GAT GTT CTC GAC CTA CGG TAT 
ala gly pro asp val leu asp leu arg tyr 
363/121 

CGC CGC GGT CGT GCT GGG TGC GAT GAT C 
arg arg gly arg ala gly cys asp asp) 



33/11 

TCG GTC CGC TGG TAT CTC GTG TTC AGC CAG 
ser val arg trp tyr leu val phe ser gin 
93/31 

GTC TTG GGG TCG GGC ATC AGC GTC GAT GTG 
val leu gly ser gly ile ser val asp val 
153/51 

TGT CAC CCC GCC ATC CTT CCA CCT CTT TTC 
cys his pro ala ile leu pro pro leu phe 
213/71 

GAG CAG AGC CAG CCA CCG GCC CAA GAA GAT 
glu gin ser gin pro pro ala gin glu asp 
273/91 

GAG GCC GCC GCG GCC GAA CCC AAA TCA TCA 
glu ala ala ala ala glu pro lys ser ser 
333/111 

CGC CTC GAC ACT ACT CGG CGT GCT ATC GGT 
arg leu asp thr thr arg arg ala ile gly 



SEQ ZD NOS: 393-394 

FIG. 18C 



part of the nucleotide sequence of seq!8A 
1/^— SEQ ID NO: 395 31/11 

GAA GGC GCC GTC GGT CGC TCG GTC CGC TGG TAT CTC GTG TTC AGC CAG CCA GCG GCC GTT 
( glu gly ala val gly arg ser val arg trp tyr leu val phe ser gin pro ala ala val 
61/21 "SEQ ID NO: 396 91/31 

AAC GTG GCC GAA CAG GTC GTC TTG GGG TCG GGC ATC AGC GTC GAT GTG GCT CAG GTC GAT 
asn val ala glu gin val val leu gly ser gly ile ser val asp val ala gin val asp 
121/41 151/51 

ACC CGA GGG GAT GGC AAG TGT CAC CCC GCC ATC CTT CCA CCT CTT TTC GGG TGC AAC GAT 
thr arg gly asp gly lys cys his pro ala ile leu pro pro leu phe gly cys asn asp 
181/61 211/71 

CGG GCC ATG CCT GAC GGG GAG CAG AGC CAG CCA CCG GCC CAA GAA GAT GCG GAA GAC GAC 
arg ala met pro asp gly glu gin ser gin pro pro ala gin glu asp ala glu asp asp 
241/81 271/91 

TCG CGG CCC GAC GCC GCG GAG GCC GCC GCG GCC GAA CCC AAA TCA TCA GCC GGT CCG ATG 
ser arg pro asp ala ala glu ala ala ala ala glu pro lys ser ser ala gly pro met 
301/101 331/111 

TTC TCG ACC TAC GGT ATC GCC TCG ACA CTA CTC GGC GTG CTA TCG GTC GCC GCG GTC GTG 
phe ser thr tyr gly ile ala ser thr leu leu gly val leu ser val ala ala val val 
361/121 

CTG GGT GCG ATG ATC 
leu gly ala met ile) 



SEQ ID NOS: 395-396 



FIG. 18A" 



— SEQ ID NO: 397 31/11 





GCG 


CCG 


TCG 


GTC 




ccici 

V— VJTVJJ 


-L v_V„ 


HPT 


VjVj J. i\ L \_. 


J. V*Vj 


TGT 


TCA 


ccc 


AGP 


CAG 


CGG 


CCG 


\ glx y x _y o 


ala 


pro 


ser 


val 


ala 


arg 


ser 


ala 


yj-jr j. j. c 


ser 


cys 


ser 


ala 


ser 


gin 


arcr 


pro 




SEQID NO: 398 














91/31 














TGG 


CCG 


AAC 


AGG 


TCG 


TCT 


TGG 


GGT 


CGG GCA 


TCA 


GCG 


TCG 


ATG 


TGG 


CTC 


AGG 


TCG 


Xcu l_ 1 IX 


trp 


pro 


asn 


arg 


ser 


ser 


trn 




aircr ala 


ser 


ala 


ser 


inet 


trn 


leu 


arg 


ser 


i oi / A 1 
X Z X / ft X 


















1 SI / SI 

X -J X / J X 




















GAG 


GGG 


ATG 


GCA 


AGT 


GTC 


ACC 


CCG 


CCA TCC 


TTC 


CAC 


CTC 


TTT 


TCG 


GGT 


GCA 


ACG 


ile pro 


glu 


gly 


met 


ala 


ser 


val 


thr 


pro 


pro ser 


one 


his 


leu 


phe 


ser 


gly 


ala 


thr 


181/61 


















211/71 


















ATC GGG 


CCA 


TGC 


CTG 


ACG 


GGG 


AGC 


AGA 


GCC 


AGC CAC 


CGG 


CCC 


AAG 


AAG 


ATG 


CGG 


AAG 


ACG 


ile gly 


pro 


cys 


leu 


thr 


gly 


ser 


arg 


ala 


ser his 


arg 


pro 


lys 


lys 


met 


arg 


lys 


thr 


241/81 


















271/91 


















ACT CGC 


GGC 


CCG 


ACG 


CCG 


CGG 


AGG 


CCG 


CCG 


CGG CCG 


AAC 


CCA 


AAT 


CAT 


CAG 


CCG 


GTC 


CGA 


thr arg 


gly 


pro 


thr 


pro 


arg 


arg 


pro 


pro 


arg pro 


asn 


pro 


asn 


his 


gin 


pro 


val 


arg 


301/101 


















331/111 


















TGT TCT 


CGA 


CCT 


ACG 


GTA 


TCG 


CCT 


CGA 


CAC 


TAC TCG 


GCG 


TGC 


TAT 


CGG 


TCG 


CCG 


CGG 


TCG 


cys ser 


arg 


pro 


thr 


val 


ser 


pro 


arg 


his 


tyr ser 


ala 


cys 


tyr 


arg 


ser 


pro 


arg 


ser 



361/121 

TGC TGG GTG CGA TGA TC 
cys trp val arg)OPA 



SEQ ID NOS: 397-398 

FIG. 18B" 



1/1^ SEQ ID NO: 399 










31/11 


















GGA AGG 


CGC 


CGT CGG TCG 


CTC 


GGT 


CCG 


CTG 


GTA TCT 


CGT 


GTT 


CAG 


CCA 


GCC 


AGC 


GGC 


CGT 


(gly arg 
61/21 ^ 


arg 


arg arg ser 


leu 


gly 


pro 


leu 


val ser 


arg 


val 


gin 


pro 


ala 


ser 


gly 


arg 


SEQ 


ID NO: 400 










91/31 


















TAA CGT 


GGC 


CGA ACA GGT 


CGT 


CTT 


GGG 


GTC 


GGG CAT 


CAG 


CGT 


CGA 


TGT 


GGC 


TCA 


GGT 


CGA 


OCH(arg 


gly 


arg thr gly 


arg 


leu 


gly 


val 


gly his 


gin 


arg 


arg 


cys 


gly 


ser 


gly 


arg 


121/41^ 


SEQ 


ID NO: 401 










151/51 


















TAC CCG 


AGG 


GGA TGG CAA 


GTG 


TCA 


CCC 


CGC 


CAT CCT 


TCC 


ACC 


TCT 


TTT 


CGG 


GTG 


CAA 


CGA 


tyr pro 


arg 


gly trp gin 


val 


ser 


pro 


arg 


his pro 


ser 


thr 


ser 


phe 


arg 


val 


gin 


arg 


181/61 














211/71 


















TCG GGC 


CAT 


GCC TGA CGG 


GGA 


GCA 


GAG 


CCA 


GCC ACC 


GGC 


CCA 


AGA 


AGA 


TGC 


GGA 


AGA 


CGA 


ser gly 


his 


ala)OPA(arg 


gly 


ala 


glu 


pro 


ala thr 


gly 


pro 


arg 


arg 


cys 


gly 


arg 


arg 


241/81 




^SEQ ID NO 


: 402 


271/91 


















CTC GCG 


GCC 


CGA CGC CGC 


GGA 


GGC 


CGC 


CGC 


GGC CGA 


ACC 


CAA 


ATC 


ATC 


AGC 


CGG 


TCC 


GAT 


leu ala 


ala 


arg arg arg 


gly 


gly 


arg 


arg 


gly arg 


thr 


gin 


ile 


ile 


ser 


arg 


ser 


asp 


301/101 














331/111 


















GTT CTC 


GAC 


CTA CGG TAT 


CGC 


CTC 


GAC 


ACT 


ACT CGG 


CGT 


GCT 


ATC 


GGT 


CGC 


CGC 


GGT 


CGT 


val leu 


asp 


leu arg tyr 


arg 


leu 


asp 


thr 


thr arg 


arg 


ala 


ile 


gly 


arg 


arg 


gly 


arg 


361/121 
































GCT GGG 


TGC 


GAT GAT C 




























ala gly 


cys 


asp asp) 





























SEQ XD NOS: 399-402 



FIG. 18C 



sequence Rv0199 predicted by Cole et al . 
containing seql8A' 



(Nature 393:537-544) and 



1/1^ SEQ ID 
atg cct gac 
(M et p ro asp 
6*1/21 SEQ 
ccc gac gcc 
pro asp ala 
121/41 
acc . tac ggt 
thr tyr gly 
181/61 
gcg atg ate 
ala met ile 
241/81 
atg ctg acc 
met leu thr 
301/101 
gcc age ctg 
ala ser leu 
361/121 
gtc gtg cag 
val val gin 
421/141 
gag gcg gta 
glu ala val 
481/161 
ccg gta gta 
pro val val 
541/181 
gcg acg teg 
ala thr ser 
601/201 
etc gat gtc 
leu asp val 



NO: 403 

ggg gag cag age cag cca ccg 
gly glu gin ser gin pro pro 
ZD NO: 404 

gcg gag gcc gcc gcg gcc gaa 
ala glu ala ala ala ala glu 

ate gcc teg aca eta etc ggc 
ile ala ser thr leu leu gly 

tgg tec gca cac cgc gat gac 
trp ser ala his arg asp asp 

gcc get gaa tgg acg gcc gtg 
ala ala glu trp thr ala val 

cag cga ctg cac gac gga acg 
gin arg leu his asp gly thr 

ccc tac egg cag gtg gtg gag 
pro tyr arg gin val val glu 

gcg ate gat acg gtg cac cgc 
ala ile asp thr val his arg 

acc acg aaa ttg cca ccg ttt 
thr thr lys leu pro pro phe 

gtc agt gag aac gcc ggc gcc 
val ser glu asn ala gly ala 

tec gat gtg gac ggc aag ctg 
ser asp val asp gly lys leu 



31/11 
gcc caa 
ala gin 
91/31 
ccc aaa 
pro lys 
151/51 
gtg eta 
val leu 
211/71 
tec ggc 
ser gly 
271/91 
ctg ate 
leu ile 
331/111 
gtc ggt 
val gly 
391/131 
aag ttg 
lys leu 
451/151 
gag ctg 
glu leu 
511/171 
gcc act 
ala thr 
571/191 
aaa ccc 
lys pro 
631/211 
atg ate 
met ile 



gaa gat 
glu asp 

tea tea 
ser ser 

teg gtc 
ser val 

gag cgt 
glu arg 

aac atg 
asn met 

caa etc 
gin leu 

egg acg 
arg thr 

gat acc 
asp thr 

cgc acc 
arg thr 

cag acc 
gin thr 

tec egg 
ser arg 



gcg 
ala 

gcc 
ala 

gcc 
ala 

acc 
thr 

aac 
asn 

aac 
asn 

cac 
his 

cag 
gin 

gac 
asp 

gtg 
val 

ttg 
leu 



gaa gac 
glu asp 

ggt ccg 
gly pro 

gcg gtc 
ala val 

tac ctg 
tyr leu 

gcc gac 
ala asp 

acc gac 
thr asp 

age age 
ser ser 

tec ggt 
ser gly 

teg gtg 
ser val 

cac tgg 
his trp 

gag teg 
glu ser 



gac 
asp 

atg 
met 

gtg 

val 

acc 
thr 

aac 
asn 

ttc 
phe 

ggc 

gly 

gcc 
ala 

ctg 
leu 

aac 
asn 

att 
ile 



teg egg 
ser arg 

ttc teg 
phe ser 

ctg ggt 
leu gly 

egg gtc 
arg val 

ate gat 
ile asp 

gac get 
asp ala 

agg ate 
arg ile 

gcc cga 
ala arg 

ctg gtc 
leu val 

ttg egg 
leu arg 

cga tga 
arg) OPA 



SEQ ID NOS: 403-404 



FIG. 18D 



ORF according to Cole et al . 



(Nature 393:537-544) and containing Rv0199 



1/1^ SEQ ID 
taa tec gat 
OCH(ser asp 
61/21^^ SEQ 
ate gac ata 
ile asp ile 
121/41 
ggc caa ttc 
gly gin phe 
181/61 
cac ccc tgc 
his pro cys 
241/81 
ctg cgc ate 
leu arg ile 
301/101 
ttg tgt etc 
leu cys leu 
361/121 
tat etc gtg 
tyr leu val 
421/141 
ggc ate age 
gly ile ser 
481/161 
ate ctt cca 
ile leu pro 
541/181 
cca ccg gee 
pro pro ala 
601/201 
gee gaa ccc 
ala glu pro 
661/221 
etc ggc gtg 
leu gly val 
721/241 
gat gac tec 
asp asp ser 
781/261 
gee gtg ctg 
ala val leu 
841/281 
gga acg gtc 
gly thr val 
901/301 
gtg gag aag 
val glu lys 
961/321 
cac cgc gag 
his arg glu 
1021/341 
ccg ttt gee 
pro phe ala 
1081/361 
ggc gee aaa 
gly ala lys 
1141/381 
aag ctg atg 
lys leu met 



NO: 405 

gee gga ttg 
ala gly leu 
ID NO: 406 

gac tec gat 
asp ser asp 

ggt ggc gtc 
gly gly val 

gcg etc gac 
ala leu asp 

ggt gee tac 
gly ala tyr 

ggc gcg gtc 
gly ala val 

ttc age cag 
phe ser gin 

gtc gat gtg 
val asp val 

cct ctt ttc 
pro leu phe 

caa gaa gat 
gin glu asp 

aaa tea tea 
lys ser ser 

eta teg gtc 
leu ser val 

ggc gag cgt 
gly glu arg 

ate aac atg 
ile asn met 

ggt caa etc 
gly gin leu 

ttg egg acg 
leu arg thr 

ctg gat ace 
leu asp thr 

act cgc acc 
thr arg thr 

ccc cag acc 
pro gin thr 

ate tec egg 
ile ser arg 



ggt gaa atg 
gly glu met 

gee gee gec 
ala ala ala 

ggc cgc get 
gly arg ala 

ggc ttc etc 
gly phe leu 

cgc age acc 
arg ser thr 

gaa cag get 
glu gin ala 

cca gcg gec 
pro ala ala 

get cag gtc 
ala gin val 

ggg tgc aac 
gly cys asn 

gcg gaa gac 
ala glu asp 

gee ggt ccg 
ala gly pro 

gee gcg gtc 
ala ala val 

acc tac ctg 
thr tyr leu 

aac gec gac 
asn ala asp 

aac acc gac 
asn thr asp 

cac age age 
his ser ser 

cag tec ggt 
gin ser gly 

gac teg gtg 
asp ser val 

gtg cac tgg 
val his trp 

ttg gag teg 
leu glu ser 



31/11 
cac caa gta acg 
his gin val thr 

91/31 
cac gec ggc acg 
his ala gly thr 

151/51 
gtc aat cgt ggc 
val asn arg gly 

211/71 
gtc gag gaa get 
val glu glu ala 

271/91 
tgc ttg get ggc 
cys leu ala gly 

331/111 
acg gaa ggc gee 
thr glu gly ala 

391/131 
gtt aac gtg gec 
val asn val ala 

451/151 
gat acc cga ggg 
asp thr arg gly 

511/171 
gat egg gee atg 
asp arg ala met 

571/191 
gac teg egg ccc 
asp ser arg pro 

631/211 
atg ttc teg acc 
met phe ser thr 

691/231 
gtg ctg ggt gcg 
val leu gly ala 

751/251 
acc egg gtc atg 
thr arg val met 

811/271 
aac ate gat gec 
asn ile asp ala 

871/291 
ttc gac get gtc 
phe asp ala val 

931/311 
ggc agg ate gag 
gly arg ile glu 

991/331 
gec gec cga ccg 
ala ala arg pro 

1051/351 
ctg ctg gtc gcg 
leu leu val ala 

1111/371 
aac ttg egg etc 
asn leu arg leu 

1171/391 
att cga tga 
ile arg)OPA 



ggt cga gtc ttt gga ate ggt 
gly arg val phe gly ile gly 

ttg cag agt gee aag ggc ggc 
leu gin ser ala lys gly gly 

caa ttc gtc gtg cag egg ttg 
gin phe val val gin arg leu 

ggc gta gag gtc gee gat gcg 
gly val glu val ala asp ala 

ctg gat gat cag gtc teg cac 
leu asp asp gin val ser his 

gtc ggt cgc teg gtc cgc tgg 
val gly arg ser val arg trp 

gaa cag gtc gtc ttg ggg teg 
glu gin val val leu gly ser 

gat ggc aag tgt cac ccc gee 
asp gly lys cys his pro ala 

cct gac ggg gag cag age cag 
pro asp gly glu gin ser gin 

gac gee gcg gag gec gee gcg 
asp ala ala glu ala ala ala 

tac ggt ate gee teg aca eta 
tyr gly ile ala ser thr leu 

atg ate tgg tec gca cac cgc 
met ile trp ser ala his arg 

ctg acc gec get gaa tgg acg 
leu thr ala ala glu trp thr 

age ctg cag cga ctg cac gac 
ser leu gin arg leu his asp 

gtg cag ccc tac egg cag gtg 
val gin pro tyr arg gin val 

gcg gta gcg ate gat acg gtg 
ala val ala ile asp thr val 

gta gta acc acg aaa ttg cca 
val val thr thr lys leu pro 

acg teg gtc agt gag aac gee 
thr ser val ser glu asn ala 

gat gtc tec gat gtg gac ggc 
asp val ser asp val asp gly 



SEQ ID NOS:405-406 



FIG. 18E 



SEQ ID 


NO: 


407 












31/11 






















CGG 


GGT 


GAG 


7\ /I 

LAL 






o a rn 


GAT GGC 


GCA 


ACT 


ATC 


GAA 


ACT 


GCA 


GGA 


CAT 


\VGLJL. CXJ.d 
_ 


gin 


arg 


gly 


glu 


hi s 


arg 


arg 


asp 


asp gly 


ala 


thr 


ile 


glu 


thr 


ala 


giy 


nis 


01/ Zl 




ID NO: 408 










91/31 


















CC^C* C* 21 A 




A A 






LAL 






/-i m 


GGG CAC 


CCC 


TGG 


CTA 


TCA 


GGC 


CAG 


CGT 


CGA 


o J- y y m 


ci x. y 


gin 


arg 


arg 


his 


ser 


arg 


gly 


gly his 


pro 


trp 


leu 


ser 


gly 


gin 


arg 


arg 


191 / A 1 


















151/51 


















J. A lol 


VjO 1 


AAA 


CAC 


ACT 




A A 

LAA 




/-t 


TTT TGA 


TGT 


GCA 


AAC 


CCC 


GGA 


GTT 


CTC 


CGC 


xcu *w _y J3 


y J-y 


lys 


his 


thr 


ala 

aia 


g In 


gin 


arg 


phe) OPA(cys 


ala 


asn 


pro 


gly 


val 


leu 


arg 


181/61 


















211/71 


— SEQ ID NO: 409 








TCG CGT 


GTT 


CAA 


GGC 


CGA 


AAA 


AGG 


GGT 


GGT 


GAC CCT 


CGG 


CGG 


CAA 


CAC 


CGT 


GGA 


GGC 


GAG 


ser arg 


val 


gin 


gly 


arg 


lys 


arg 


gly 


gly 


asp pro 


arg 


arg 


gin 


his 


arg 


gly 


gly 


glu 


241/81 


















271/91 


















GGC GCT 


CGA 


GTA 


CAG 


CCT 


CGG 


CAC 


ACC 


GCC 


GGA CGG 


GGT 


GAC 


GGG 


CCC 


GCT 


GGT 


GGC 


TGC 


gly ala 


arg 


val 


gin 


pro 


arg 


his 


thr 


ala 


gly arg 


gly 


asp 


gly 


pro 


ala 


gly 


gly 


cys 


301/101 


















331/111 


















CCC CGC 


CGA 


CGA 


CAG 


TCC 


GGG 


CTG 


CAG 


TCC 


GTC GGA 


CTA 


CGA 


CAG 


GCT 


GCC 


GGT 


GTC 


CGG 


pro arg 


arg 


arg 


gin 


ser 


gly 


leu 


gin 


ser 


val gly 


leu 


arg 


gin 


ala 


ala 


gly 


val 


arg 



361/121 

TGC GGT GGT GCT GGT AGA TC 
cys gly gly ala gly arg) 



SEQ ID NOS: 407-409 

FIG. 19A 



^ — SEQ ID NO: 410 

TTG CGC AAC GGG GTG AGC ACC GAC GCG ATG 
( leu ar g asn gly val ser thr asp ala met 
6*2/21 ^ — SEQ ID NO: 411 

GCC AAC GCC AAC GAC GGC ACT CGC GCG GTG 
ala asn ala asn asp gly thr arg ala val 
122/41 

TAT GTG GTA AAC ACA CTG CGC AAC AGC GGT 
tyr val val asn thr leu arg asn ser gly 
182/61 

CGC GTG TTC AAG GCC GAA AAA GGG GTG GTG 
arg val phe lys ala glu lys gly val val 
242/81 

GCG CTC GAG TAC AGC CTC GGC ACA CCG CCG 
ala leu glu tyr ser leu gly thr pro pro 
302/101 

CCC GCC GAC GAC AGT CCG GGC TGC AGT CCG 
pro ala asp asp ser pro gly cys ser pro 
362/121 

GCG GTG GTG CTG GTA GAT C 
ala val val leu val asp) 



32/11 

ATG GCG CAA CTA TCG AAA CTG CAG GAC ATC 
met ala gin leu ser lys leu gin asp ile 
92/31 

GGC ACC CCT GGC TAT CAG GCC AGC GTC GAC 
gly thr pro gly tyr gin ala ser val asp 
152/51 

TTT GAT GTG CAA ACC CCG GAG TTC TCC GCT 
phe asp val gin thr pro glu phe ser ala 
212/71 

ACC CTC GGC GGC AAC ACC GTG GAG GCG AGG 
thr leu gly gly asn thr val glu ala arg 
272/91 

GAC GGG GTG ACG GGC CCG CTG GTG GCT GCC 
asp gly val thr gly pro leu val ala ala 
332/111 

TCG GAC TAC GAC AGG CTG CCG GTG TCC GGT 
ser asp tyr asp arg leu pro val ser gly 



SEQ ID NOS: 410-411 



FIG. 19B 



^ SEQ ID NO: 412 ^~SEQ ID NO: 414 33/11 

TGC GCA ACG GGG TGA^GCA CCG ACG CGA TGA TGG CGC AAC TAT CGA AAC TGC AGG ACA TCG 
(cysala thr gly)OPA(ala pro thr arg)OPA(trp arg asn tyr arg asn cys arg thr ser 
63721"" SEQ ID NO: 413 SEQ ID NO: 415 r 93/31 

CCA ACG CCA ACG ACG GCA CTC GCG CGG TGG GCA CCC CTG GCT ATC AGG CCA GCG TCG ACT 
pro thr pro thr thr ala leu ala arg trp ala pro leu ala ile arg pro ala ser thr 
123/41 153/51 

ATG TGG TAA ACA CAC TGC GCA ACA GCG GTT TTG ATG TGC AAA CCC CGG AGT TCT CCG CTC 
met trp)OCH(thr his cys ala thr ala val leu met cys lys pro arg ser ser pro leu 
183/61 ^SEQ ID NO: 416 213/71 

GCG TGT TCA AGG CCG AAA AAG GGG TGG TGA CCC TCG GCG GCA ACA CCG TGG AGG CGA GGG 
ala cys ser arg pro lys lys gly trp)OPA(pro ser ala ala thr pro trp arg arg gly 
243/81 SEQ ID NO: 417 ^ 273/91 

CGC TCG AGT ACA GCC TCG GCA CAC CGC CGG ACG GGG TGA CGG GCC CGC TGG TGG CTG CCC 

arg ser ser thr ala ser ala his arg arg thr gly)OPA(arg ala arg trp trp leu pro 

303/101 333/111 ^-SEQ ID NO: 418 

CCG CCG ACG ACA GTC CGG GCT GCA GTC CGT CGG ACT ACG ACA GGC TGC CGG TGT CCG GTG 

pro pro thr thr val arg ala ala val arg arg thr thr thr gly cys arg cys pro val 

363/121 

CGG TGG TGC TGG TAG ATC 
arg trp cys trp) AMB ile 

SEQ ID NOS: 412-418 

FIG. 19C 



part of the nucleotide sequence of seq!9A 



1/1 SEQ ID NO: 


419 










31/11 


















CTA TCG 


AAA 


CTG 


CAG 


GAC 


ATC 


GCC 


AAC 


GCC 


AAC GAC 


GGC 


ACT 


CGC 


GCG 


GTG 


GGC 


ACC 


CCT 


(leu ser 


lys 


leu 


gin 


asp 


ile 


ala 


asn 


ala 


asn asp 


gly 


thr 


arg 


ala 


val 


gly 


thr 


pro 


61/21^- 


- SEQ ID 


NO: 


420 










91/31 


















GGC TAT 


CAG 


GCC 


AGC 


GTC 


GAC 


TAT 


GTG 


GTA 


AAC ACA 


CTG 


CGC 


AAC 


AGC 


GGT 


TTT 


GAT 


GTG 


gly tyr 


gin 


ala 


ser 


val 


asp 


tyr 


val 


val 


asn thr 


leu 


arg 


asn 


ser 


gly 


phe 


asp 


val 


121/41 


















151/51 


















CAA ACC 


CCG 


GAG 


TTC 


TCC 


GCT 


CGC 


GTG 


TTC 


AAG GCC 


GAA 


AAA 


GGG 


GTG 


GTG 


ACC 


CTC 


GGC 


gin thr 


pro 


glu 


phe 


ser 


ala 


arg 


val 


phe 


lys ala 


glu 


lys 


gly 


val 


val 


thr 


leu 


gly 


181/61 


















211/71 


















GGC AAC 


ACC 


GTG 


GAG 


GCG 


AGG 


GCG 


CTC 


GAG 


TAC AGC 


CTC 


GGC 


ACA 


CCG 


CCG 


GAC 


GGG 


GTG 


gly asn 


thr 


val 


glu 


ala 


arg 


ala 


leu 


glu 


tyr ser 


leu 


gly 


thr 


pro 


pro 


asp 


gly 


val 


241/81 


















271/91 


















ACG GGC 


CCG 


CTG 


GTG 


GCT 


GCC 


CCC 


GCC 


GAC 


GAC AGT 


CCG 


GGC 


TGC 


AGT 


CCG 


TCG 


GAC 


TAC 


thr gly 


pro 


leu 


val 


ala 


ala 


pro 


ala 


asp 


asp ser 


pro 


gly 


cys 


ser 


pro 


ser 


asp 


tyr 


301/101 


















331/111 


















GAC AGG 


CTG 


CCG 


GTG 


TCC 


GGT 


GCG 


GTG 


GTG 


CTG GTA 


GAT 


C 














asp arg 


leu 


pro 


val 


ser 


gly 


ala 


val 


val 


leu val 


asp) 















SEQ XD NO419-420 



FIG. 19A" 



MAR 2 6 2003 8] 



1/ 1 ^ SEQ ID NO: 421 31/11 

'fAT CGA AAC TGC AGG ACA TCG CCA ACG CCA ACG ACG GCA CTC GCG CGG TGG GCA CCC CTG 
( ty r ar g asn cys arg thr ser pro thr pro thr thr ala leu ala arg trp ala pro leu 
61/21^ SEQ ID NO: 422 91/31 

GCT ATC AGG CCA GCG TCG ACT ATG TGG TAA ACA CAC TGC GCA ACA GCG GTT TTG ATG TGC 
ala ile arg pro ala ser thr met trp)OCH(thr his cys ala thr ala val leu met cys 
121/41 SEQ ID NO: 423^151/51 

AAA CCC CGG AGT TCT CCG CTC GCG TGT TCA AGG CCG AAA AAG GGG TGG TGA CCC TCG GCG 
lys pro arg ser ser pro leu ala cys ser arg pro lys lys gly trp)OPA(pro ser ala 
181/61 211/71 SEQ ID NO: 424—^ 

GCA ACA CCG TGG AGG CGA GGG CGC TCG AGT ACA GCC TCG GCA CAC CGC CGG ACG GGG TGA 
ala thr pro trp arg arg gly arg ser ser thr ala ser ala his arg arg thr gly)OPA 
241/81 271/91 

CGG GCC CGC TGG TGG CTG CCC CCG CCG ACG ACA GTC CGG GCT GCA GTC CGT CGG ACT ACG 
(arg ala arg trp trp leu pro pro pro thr thr val arg ala ala val arg arg thr thr 
301/101 —SEQ ID NO: 425 331/111 
ACA GGC TGC CGG TGT CCG GTG CGG TGG TGC TGG TAG ATC 
thr gly cys arg cys pro val arg trp cys trp) AMB ile 



SEQ ID NOS: 421-425 

FIG. 19B' 



1/1^^-^SEQ ID NO: 42 6 

ATC GAA ACT GCA GGA CAT CGC CAA CGC CAA 
( ile glu thr ala gly his arg gin arg gin 
61/21 ^ SEQ ID NO: 427 

CTA TCA GGC CAG CGT CGA CTA TGT GGT AAA 
leu ser gly gin arg arg leu cys gly lys 
121/41 

AAC CCC GGA GTT CTC CGC TCG CGT GTT CAA 
asn pro gly val leu arg ser arg val gin 
181/61 

CAA CAC CGT GGA GGC GAG GGC GCT CGA GTA 
gin his arg gly gly glu gly ala arg val 
241/81 

GGG CCC GCT GGT GGC TGC CCC CGC CGA CGA 
gly pro ala gly gly cys pro arg arg arg 
301/101 

CAG GCT GCC GGT GTC CGG TGC GGT GGT GCT 
gin ala ala gly val arg cys gly gly ala 



31/11 


















CGA CGG 


CAC 


TCG 


CGC 


GGT 


GGG 


CAC 


CCC 


TGG 


arg arg 


his 


ser 


arg 


gly 


gly 


his 


pro 


trp 


91/31 


















CAC ACT 


GCG 


CAA 


CAG 


CGG 


TTT 


TGA 


TGT 


GCA 


his thr 


ala 


gin 


gin 


arg 


phe)OPA(cys 


ala 


151/51 




SEQ ID NO: 


428-^ 






GGC CGA 


AAA 


AGG 


GGT 


GGT 


GAC' 


CCT 


CGG 


CGG 


gly arg 


lys 


arg 


gly 


gly 


asp 


pro 


arg 


arg 


211/71 


















CAG CCT 


CGG 


CAC 


ACC 


GCC 


GGA 


CGG 


GGT 


GAC 


gin pro 


arg 


his 


thr 


ala 


gly 


arg 


gly 


asp 


271/91 


















CAG TCC 


GGG 


CTG 


CAG 


TCC 


GTC 


GGA 


CTA 


CGA 


gin ser 


gly 


leu 


gin 


ser 


val 


gly 


leu 


arg 


331/111 


















GGT AGA 


TC 
















gly arg) 

















SEQ ID NOS : 426-428 



FIG. 19C 



MAR 2 6 2KB 

sequence Rv0418 predicted by Cole et al . (Nature 393:537-544) and 
containing seql9A' 

1/1_^-SEQ ID NO: 429 31/11 

a£g gtg aac aaa tec agg atg atg ccg gcg gtg ctg gec gtg get gtg gtc gtc gca ttc 
( Met va l asn lys ser arg met met pro ala val leu ala val ala val val val ala phe 
61/21 ^SEQ ID NO: 430 91/31 

ctg acg acg ggc tgt ate egg tgg tct acg cag teg egg ccc gtt gtt aac ggc ccc get 
leu thr thr gly cys ile arg trp ser thr gin ser arg pro val val asn gly pro ala 
121/41 151/51 

gec gca gag ttc gee gtt gcg ttg cgc aac egg gtg age acc gac gcg atg atg gcg cac 
ala ala glu phe ala val ala leu arg asn arg val ser thr asp ala met met ala his 
181/61 211/71 

eta teg aaa ctg cag gac ate gec aac gee aac gac ggc act cgc gcg gtg ggc acc cct 
leu ser lys leu gin asp ile ala asn ala asn asp gly thr arg ala val gly thr pro 
241/81 271/91 

ggc tat cag gec age gtc gac tat gtg gta aac aca ctg cgc aac age ggt ttt gat gtg 
gly tyr gin ala ser val asp tyr val val asn thr leu arg asn ser gly phe asp val 
301/101 331/111 

caa acc ccg gag ttc tec get cgc gtg ttc aag gee gaa aaa ggg gtg gtg acc etc ggc 
glri thr pro glu phe ser ala arg val phe lys ala glu lys gly val val thr leu gly 
361/121 391/131 

ggc aac acc gtg gag gcg agg gcg etc gag tac age etc ggc aca ccg ccg gac ggg gtg 
gly asn thr val glu ala arg ala leu glu tyr ser leu gly thr pro pro asp gly val 
421/141 451/151 

acg ggc ccg ctg gtg get gec ccc gee gac gac agt ccg ggc tgc agt ccg teg gac tac 
thr gly pro leu val ala ala pro ala asp asp ser pro gly cys ser pro ser asp tyr 
481/161 511/171 

gac agg ctg ccg gtg tec ggt gcg gtg gtg ctg gta gat cgc ggc gtc tgt cct ttt gee 
asp arg leu pro val ser gly ala val val leu val asp arg gly val cys pro phe ala 
541/181 571/191 

cag aag gaa gac gca gec gcg cag cgc ggt gcg gtg gcg ctg ate att get gac aac ate 
gin lys glu asp ala ala ala gin arg gly ala val ala leu ile ile ala asp asn ile 
601/201 631/211 

gac gag cag gcg atg ggc ggc acc ctg ggg get aat acc gac gtc aag ate ccg gtg gtg 
asp glu gin ala met gly gly thr leu gly ala asn thr asp val lys ile pro val val 
661/221 691/231 

agt gtc acc aag teg gtc gga ttc cag eta cgc gga cag tct ggg cca acc acc gtc aag 
ser val thr lys ser val gly phe gin leu arg gly gin ser gly pro thr thr val lys 
721/241 751/251 

etc acg gcg age acc caa agt ttc aag gec cgc aac gtc ate gcg cag acg aag acg ggg 
leu thr ala ser thr gin ser phe lys ala arg asn val ile ala gin thr lys thr gly 
781/261 811/271 

teg teg gec aac gtg gtg atg gca ggt gcg cat ttg gac age gtt ccg gaa gga ccc ggc 
ser ser ala asn val val met ala gly ala his leu asp ser val pro glu gly pro gly 
841/281 871/291 

ate aac gac aac ggc teg gga gtg get gcg gtt ctg gaa acg gca gtg cag ctg ggg aac 
ile asn asp asn gly ser gly val ala ala val leu glu thr ala val gin leu gly asn 
901/301 931/311 

tea ccg cat gtg tec aac gcg gta egg ttc gec ttc tgg ggc gee gag gaa ttc ggc ctg 
ser pro his val ser asn ala val arg phe ala phe trp gly ala glu glu phe gly leu 
961/321 991/331 

att ggg tea cga aac tac gtc gag teg ctg gac ate gac gcg etc aaa ggc ate gcg ctg 
ile gly ser arg asn tyr val glu ser leu asp ile asp ala leu lys gly ile ala leu 



SEQ ID NOS: 429-430 



FIG. 19D 



1021/341 1051/351 

tat ctg aac ttc gac atg ttg gcg teg ccg aac ccg ggt tac ttc acc tac gac ggt gac 
tyr leu asn phe asp met leu ala ser pro asn pro gly tyr phe thr tyr asp gly asp 
1081/361 1111/371 

cag teg ctg ccg eta gac gee cgc ggt cag ccg gtg gtg ccc gaa ggc teg gee ggt ate 
gin ser leu pro leu asp ala arg gly gin pro val val pro glu gly ser ala gly ile 
1141/381 1171/391 

gag cgc acg ttc gtc gee tat ctg aag atg gee ggc aag acc gcg cag gac acc teg ttc 
glu arg thr phe val ala tyr leu lys met ala gly lys thr ala gin asp thr ser phe 
1201/401 1231/411 

gac ggt egg tec gac tac gac ggc ttc acg ctg gcg ggt ate cct teg ggt ggc ctg ttc 
asp gly arg ser asp tyr asp gly phe thr leu ala gly ile pro ser gly gly leu phe 
1261/421 1291/431 

tec ggc get gag gtc aag aag tec gee gag caa gee gag etc tgg ggc ggc acc gec gac 
ser gly ala glu val lys lys ser ala glu gin ala glu leu trp gly gly thr ala asp 
1321/441 1351/451 

gag cct ttc gat ccc aac tat cac cag aag aca gac acc ctg gac cat ate gac cgc acc 
glu pro phe asp pro asn tyr his gin lys thr asp thr leu asp his ile asp arg thr 
1381/461 1411/471 

gcg etc ggt ate aac ggc get ggc gtc gcg tac gcg gtg ggt ttg tat gcg cag gac etc 
ala leu gly ile asn gly ala gly val ala tyr ala val gly leu tyr ala gin asp leu 
1441/481 1471/491 

ggc ggc ccc aac ggg gtt ccg gtc atg gcg gac cgc acc cgc cac ctg att gee aaa ccg 

gly gly pro asn gly val pro val met ala asp arg thr arg his leu ile ala lys pro) 

1501/501 

tga 

OPA 



SEQ ID NOS: 429-430 (continued) 

FIG. 19D (continued) 




ORF according to Cole et al . (Nature 393:537-544) and containing Rv0418 
1/^^-SEQ ID NOs 431 31/11 

tag gcc att caa cgc tct gtt cgt ttg att ggt egg tgg gat gcg aaa get gcg egg cga 

AMB (ala ile gin arg ser val arg leu ile gly arg trp asp ala lys ala ala arg arg 
61/21* SEQ ID NO: 432 91/31 

cag gcg egg tct aat ctg ggc gcg atg gtg aac aaa tec agg atg atg ccg gcg gtg ctg 

gin ala arg ser asn leu gly ala met val asn lys ser arg met met pro ala val leu 
121/41 151/51 

gcc gtg get gtg gtc gtc gca ttc ctg acg acg ggc tgt ate egg tgg tct acg cag teg 

ala val ala val val val ala phe leu thr thr gly cys ile arg trp ser thr gin ser 
181/61 211/71 

egg ccc gtt gtt aac ggc ccc get gcc gca gag ttc gcc gtt gcg ttg cgc aac egg gtg 

arg pro val val asn gly pro ala ala ala glu phe ala val ala leu arg asn arg val 
241/81 271/91 

age ace gac gcg atg atg gcg cac eta teg aaa ctg cag gac ate gcc aac gcc aac gac 

ser thr asp ala met met ala his leu ser lys leu gin asp ile ala asn ala asn asp 
301/101 331/111 

ggc act cgc gcg gtg ggc ace cct ggc tat cag gcc age gtc gac tat gtg gta aac aca 

gly thr arg ala val gly thr pro gly tyr gin ala ser val asp tyr val val asn thr 
361/121 391/131 

ctg cgc aac age ggt ttt gat gtg caa ace ccg gag ttc tec get cgc gtg ttc aag gcc 

leu arg asn ser gly phe asp val gin thr pro glu phe ser ala arg val phe lys ala 
421/141 451/151 

gaa aaa ggg gtg gtg ace etc ggc ggc aac ace gtg gag gcg agg gcg etc gag tac age 

glu lys gly val val thr leu gly gly asn thr val glu ala arg ala leu glu tyr ser 
481/161 511/171 

etc ggc aca ccg ccg gac ggg gtg acg ggc ccg ctg gtg get gcc ccc gcc gac gac agt 

leu gly thr pro pro asp gly val thr gly pro leu val ala ala pro ala asp asp ser 
541/181 571/191 

ccg ggc tgc agt ccg teg gac tac gac agg ctg ccg gtg tec ggt gcg gtg gtg ctg gta 

pro gly cys ser pro ser asp tyr asp arg leu pro val ser gly ala val val leu val 
601/201 631/211 

gat cgc ggc gtc tgt cct ttt gcc cag aag gaa gac gca gcc gcg cag cgc ggt gcg gtg 

asp arg gly val cys pro phe ala gin lys glu asp ala ala ala gin arg gly ala val 
661/221 691/231 

gcg ctg ate att get gac aac ate gac gag cag gcg atg ggc ggc ace ctg ggg get aat 

ala leu ile ile ala asp asn ile asp glu gin ala met gly gly thr leu gly ala asn 
721/241 751/251 

ace gac gtc aag ate ccg gtg gtg agt gtc ace aag teg gtc gga ttc cag eta cgc gga 

thr asp val lys ile pro val val ser val thr lys ser val gly phe gin leu arg gly 
781/261 811/271 

cag tct ggg cca ace ace gtc aag etc acg gcg age acc caa agt ttc aag gcc cgc aac 

gin ser gly pro thr thr val lys leu thr ala ser thr gin ser phe lys ala arg asn 
841/281 871/291 

gtc ate gcg cag acg aag acg ggg teg teg gcc aac gtg gtg atg gca ggt gcg cat ttg 

val ile ala gin thr lys thr gly ser ser ala asn val val met ala gly ala his leu 
901/301 931/311 

gac age gtt ccg gaa gga ccc ggc ate aac gac aac ggc teg gga gtg get gcg gtt ctg 

asp ser val pro glu gly pro gly ile asn asp asn gly ser gly val ala ala val leu 
961/321 991/331 

gaa acg gca gtg cag ctg ggg aac tea ccg cat gtg tec aac gcg gta egg ttc gcc ttc 

glu thr ala val gin leu gly asn ser pro his val ser asn ala val arg phe ala phe 

SEQ ID NOS: 431-432 



FIG. 19E 




1021/341 
















1051/351 
















L yy yy L gt,o, 
trp gly ala 
1081/361 


rr J5 <*r 

glu 


gaa 
glu 


\- \- r* 
LLL 

phe 


ggc 
gly 


c tg 
leu 


-a ■»- 4- 

aLL 

ile 


ggg 

gly 


LCa Cya aaC 

ser arg asn 
1111/371 


4- 

cac 
tyr 


gtc 
val 


gag 
glu 


teg 
ser 


ctg 
leu 


gac 
asp 


ate 
ile 


y aL y L.y l l l. 

asp ala leu 
1141/381 


aaa 
lys 


ggc 
gly 


d L L 

ile 


gcg 
ala 


ctg 
leu 


La. L 

tyr 


c tg 
leu 


aac ttc gac 
asn phe asp 
1171/391 


_v 4- ,~ 

aCy 

met 


ttg 
leu 


gcg 
ala 


teg 
ser 


ccg 
pro 


aac 
asn 


ccg 
pro 


yyt CaC LLC 

gly tyr phe 
1201/401 


acc 
thr 


*- t-+ 

LaC 

tyr 


gac 
asp 


ggt 
gly 


gac 
asp 


cag 
gin 


teg 
ser 


ctg ccg eta 
leu pro leu 
1231/411 


gac 
asp 


gee 
ala 


cgc 
arg 


ggt 
gly 


cag 
gin 


ccg 
pro 


gtg 

val 


gtg ccc gaa 
val pro glu 
1261/421 


ggc 
gly 


teg 
ser 


gee 
ala 


ggt 
gly 


ate 
ile 


gag 
glu 


cgc 
arg 


acg ttc gtc 
thr phe val 
1291/431 


gee 
ala 


tat 
tyr 


ctg 
leu 


aag 
lys 


atg 
met 


gee 
ala 


ggc 

gly 


aag acc gcg 
lys thr ala 
1321/441 


cag 
gin 


gac 
asp 


acc 
thr 


teg 
ser 


LLC 

phe 


gac 
asp 


ggt 
gly 


egg tec gac 
arg ser asp 
1351/451 


tac 
tyr 


gac 
asp 


ggc 
gly 


ttc 
phe 


acg 
thr 


ctg 
leu 


gcg 

ala 


yy l aLU ccl 
gly ile pro 
1381/461 


teg 
ser 


ggt 

gly 


ggc 
gly 


ctg 
leu 


4- 

LLC 

phe 


tec 
ser 


ggc 
gly 


get gag gtc 
ala glu val 
1411/471 


aag 
lys 


aag 
lys 


tec 
ser 


gee 
ala 


gag 
glu 


caa 
gin 


gee 
ala 


y ciy l. Lt- Lyy 
glu leu trp 

1441 / 4 ft 1 


ggc 
gly 


ggc 
gly 


acc 
thr 


gec 
ala 


gac 
asp 


gag 
glu 


ce t 
pro 


LLC yaL CCC 

phe asp pro 

1 471 //i Q1 


aac 
asn 


4~ 4~ 

LaL 

tyr 


cac 
his 


cag 
gin 


aag 
lys 


aca 
thr 


gac 
asp 


acc ctg gac 
thr leu asp 
1501/501 


cat 
his 


ate 
ile 


gac 
asp 


cgc 
arg 


acc 
thr 


gcg 
ala 


etc 
leu 


ggt ate aac 
gly ile asn 
1531/511 


ggc 

gly 


get 
ala 


ggc 
gly 


gtc 
val 


gcg 
ala 


tac 
tyr 


gcg 
ala 


gtg ggt ttg 
val gly leu 
1561/521 


tat 
tyr 


gcg 
ala 


cag 
gin 


gac 
asp 


etc 
leu 


ggc 

gly 


ggc 
gly 


ccc aac ggg 
pro asn gly 


gtt 

val 


ccg 
pro 


gtc 
val 


atg 
met 


gcg 
ala 


gac 
asp 


cgc 
arg 


acc cgc cac 
thr arg his 


ctg 
leu 


att 
ile 


gee 
ala 


aaa 
lys 


ccg tga 
pro) OPA 





















SEQ XD NOS: 431-432 (continued) 



FIG. 19E ( cont inued } 



^ SEQ ID NO: 433 31/11 

CGA GAC AGT GGT GCG GGA CAC TTG AGT TCG GCT GCT AAC GAC GCC AGA GTC GCC CGC TTC 

( arg as p ser gly ala gly his leu ser ser ala ala asn asp ala arg val ala arg phe 
61/21 — SEQ ID NO: 434 91/31 

CGC GGT GTG GGA CTC ACG TTC GGT GAG GGT ACA GCG GAC CTT CGA GCA CGC AAT ATC GTG 

arg gly val gly leu thr phe gly glu gly thr ala asp leu arg ala arg asn ile val 
121/41 151/51 

GGC CGG CTG GCA ACC GTC GGT TTC GAC GTT GGT GAC GAC CCC TCG TTC ATG AAT CGT TCT 

gly arg leu ala thr val gly phe asp val gly asp asp pro ser phe met asn arg ser) 
181/61 211/71 

TGA GCT CCC CGT TTT GCT GGA TGC CCA GGC ACC GCC GGT ACT GCT GCG CTT AAG CTT GTC 

OPA (ala pro arg phe ala gly cys pro gly thr ala gly thr ala ala leu lys leu val 
241/81 SEQ ID NO: 435 271/91 

GCA CAT GGT GCC GGC AGG GAG GAA CAG TGG GCA AGC AGC TAG CCG CGC TCG CCG CGC TGG 

ala his gly ala gly arg glu glu gin trp ala ser ser)AMB(pro arg ser pro arg trp 

301/101 331/111 ^SEQ ID NO: 436 

TCG GTG CGT GCA TGC TCG CAG CCG GAT GCA CCA ACG TGG TCG ACG GGA CCG CCG TGG CTG 

ser val arg ala cys ser gin pro asp ala pro thr trp ser thr gly pro pro trp leu 

361/121 

CCG ACA AAT CCG GAC CAC TGC ATC AGG ATC 

pro thr asn pro asp his cys ile arg ile) 

SEQ XD NOS:433-436 



FIG. 20A 



VM 2 6 2KB 81 




p *SEQ 


ID NO: 437 


SEQ ID NO: 


439 


32/11 


















GAG ACA 


GTG 


GTG 


CGG 


GAC 


ACT 


TGA\ 


GTT 


CGG 


CTG CTA 


ACG 


ACG 


CCA 


GAG 


TCG 


CCC 


GCT 


TCC 


(glu thr 


val 


val 


arg 


asp 


thr)OPA(val 


arg 


leu leu 


thr 


thr 


pro 


glu 


ser 


pro 


ala 


ser 


62/21 ^ 


SEQ 


ID NO: 438 










92/31 


















GCG GTG 


TGG 


GAC 


TCA 


CGT 


TCG 


GTG 


AGG 


GTA 


CAG CGG 


ACC 


TTC 


GAG 


CAC 


GCA 


ATA 


TCG 


TGG 


ala val 


trp 


asp 


ser 


arg 


ser 


val 


arg 


val 


gin arg 


thr 


phe 


glu 


his 


ala 


ile 


ser 


trp 


122/41 


















152/51 


















GCC GGC 


TGG 


CAA 


CCG 


TCG 


GTT 


TCG 


ACG 


TTG 


GTG ACG 


ACC 


CCT 


CGT 


TCA 


TGA 


ATC 


GTT 


CTT 


ala gly 


trp 


gin 


pro 


ser 


val 


ser 


thr 


leu 


val thr 


thr 


pro 


arg 


ser ) OPA (ile 


val 


leu 


182/61 


















212/71 


SEQ ID NO: 


440 * 








GAG CTC 


ccc 


GTT 


TTG 


CTG 


GAT 


GCC 


CAG 


GCA 


CCG CCG 


GTA 


CTG 


CTG 


CGC 


TTA 


AGC 


TTG 


TCG 


glu leu 


pro 


val 


leu 


leu 


asp 


ala 


gin 


ala 


pro pro 


val 


leu 


leu 


arg 


leu 


ser 


leu 


ser 


242/81 


















272/91 


















CAC ATG 


GTG 


CCG 


GCA 


GGG 


AGG 


AAC 


AGT 


GGG 


CAA GCA 


GCT 


AGC 


CGC 


GCT 


CGC 


CGC 


GCT 


GGT 


his met 


val 


pro 


ala 


gly 


arg 


asn 


ser 


gly 


gin ala 


ala 


ser 


arg 


ala 


arg 


arg 


ala 


gly 


302/101 


















332/111 


















CGG TGC 


GTG 


CAT 


GCT 


CGC 


AGC 


CGG 


ATG 


CAC 


CAA CGT 


GGT 


CGA 


CGG 


GAC 


CGC 


CGT 


GGC 


TGC 


arg cys 


val 


his 


ala 


arg 


ser 


arg 


met 


his 


gin arg 


gly 


arg 


arg 


asp 


arg 


arg 


gly 


cys 


362/121 




































CGA CAA 


ATC 


CGG 


ACC 


ACT 


GCA 


TCA 


GGA 


TC 




















arg gin 


ile 


arg 


thr 


thr 


ala 


ser 


gly) 























SEQ ID NOSs437-440 

FIG. 20B 



^ SEQ 


ID NO: 441 












33/11 


















AGA CAG 


TGG 


TGC 


GGG 


ACA 


CTT 


GAG 


TTC 


GGC 


TGC TAA 


CGA 


CGC 


CAG 


AGT 


CGC 


CCG 


CTT 


CCG 


(arg gin 


trp 


cys 


gly 


thr 


leu 


glu 


phe 


gly 


cys)OCH(arg 


arg 


gin 


ser 


arg 


pro 


leu 


pro 


63/2f^ 


SEQ 


ID NO: 442 










93/31 


SEQ ID NO: 443 








CGG TGT 


GGG 


ACT 


CAC 


GTT 


CGG 


TGA 


GGG 


TAC 


AGC GGA 


CCT 


TCG 


AGC 


ACG 


CAA 


TAT 


CGT 


GGG 


arg cys 


gly 


thr 


his 


val 


arg)OPA(gly 


tyr 


ser gly 


pro 


ser 


ser 


thr 


gin 


tyr 


arg 


gly 


123/41 






SEQ 


ID NO: 444 




153/51 


















CCG GCT 


GGC 


AAC 


CGT 


CGG 


TTT 


CGA 


CGT 


TGG 


TGA CGA 


CCC 


CTC 


GTT 


CAT 


GAA 


TCG 


TTC 


TTG 


pro ala 


gly 


asn 


arg 


arg 


phe 


arg 


arg 


trp) OPA (arg 


pro 


leu 


val 


his 


glu 


ser 


phe 


leu 


183/61 


















213/71 




- SEQ 


ID NO: 445 








AGC TCC 


CCG 


TTT 


TGC 


TGG 


ATG 


CCC 


AGG 


CAC 


CGC CGG 


TAC 


TGC 


TGC 


GCT 


TAA 


GCT 


TGT 


CGC 


ser ser 


pro 


phe 


cys 


trp 


met 


pro 


arg 


his 


arg arg 


tyr 


cys 


cys 


ala)OCH(ala 


cys 


arg 


243/81 


















273/91 


SEQ ID NO: 44 6 — 








ACA TGG 


TGC 


CGG 


CAG 


GGA 


GGA 


ACA 


GTG 


GGC 


AAG CAG 


CTA 


GCC 


GCG 


CTC 


GCC 


GCG 


CTG 


GTC 


thr trp 


cys 


arg 


gin 


gly 


gly 


thr 


val 


gly 


lys gin 


leu 


ala 


ala 


leu 


ala 


ala 


leu 


val 


303/101 


















333/111 


















GGT GCG 


TGC 


ATG 


CTC 


GCA 


GCC 


GGA 


TGC 


ACC 


AAC GTG 


GTC 


GAC 


GGG 


ACC 


GCC 


GTG 


GCT 


GCC 


gly ala 


cys 


met 


leu 


ala 


ala 


gly 


cys 


thr 


asn val 


val 


asp 


gly 


thr 


ala 


val 


ala 


ala 


363/121 




































GAQ • AAA 


TCC 


GGA 


CCA 


.CTG 


CAT 


CAG 


GAT 


C 




















asp lys 


ser 


gly 


pro 


leu 


his 


gin 


asp) 























SEQ ID NOS: 441-446 

FIG. 20C 




part of the nucleotide sequence of seq2 0A 
1/1^- SEQ ID NO: 447 SEQ ID NO: 449 31/11 



TGT 


GGG 


ACT 


CAC 


GTT 


CGG 


TGAL 


GGG 


TAC 


AGC 


GGA CCT 


TCG 


AGC 


ACG 


CAA 


TAT 


CGT 


GGG 


CCG 


(cys 


gly 


thr 


his 


val 


arg)OPA(gly 


tyr 


ser 


gly pro 


ser 


ser 


thr 


gin 


tyr 


arg 


gly 


pro 


61/21^- 


• SEQ 


ID NO: 448 










91/31 


















GCT 


GGC 


AAC 


/-> rp 




TTT 


CGA 


CGT 


TGG 


TGA 


CGA CCC 


CTC 


GTT 


CAT 


GAA 


TCG 


TTC 


TTG 


AGC 


ala 


gly 


asn 


arg 


arg 


phe 


arg 


arg 


trp)OPA(arg pro 


leu 


val 


his 


glu 


ser 


phe 


leu 


ser 


121/41 








SEQ ID 


NO: 


450- 




151/51 


















TCC 


CCG 


TTT 


TGC 


TGG 


ATG 


CCC 


AGG 


CAC 


CGC 


CGG TAC 


TGC 


TGC 


GCT 


TAA 


GCT 


TGT 


CGC 


ACA 


ser 


pro 


phe 


cys 


trp 


met 


pro 


arg 


his 


arg 


arg tyr 


cys 


cys 


ala)OCH(ala 


cys 


arg 


thr 


181/61 


















211/71 


SEQ 


ID NO: 451-^ 








TGG 


TGC 


CGG 


CAG 


GGA 


GGA 


ACA 


GTG 


GGC 


AAG 


CAG CTA 


GCC 


GCG 


CTC 


GCC 


GCG 


CTG 


GTC 


GGT 


trp 


cys 


arg 


gin 


gly 


gly 


thr 


val 


gly 


lys 


gin leu 


ala 


ala 


leu 


ala 


ala 


leu 


val 


gly 


241/81 


















271/91 


















GCG 


TGC 


ATG 


CTC 


GCA 


GCC 


GGA 


TGC 


ACC 


AAC 


GTG GTC 


GAC 


GGG 


ACC 


GCC 


GTG 


GCT 


GCC 


GAC 


ala 


cys 


met 


leu 


ala 


ala 


gly 


cys 


thr 


asn 


val val 


asp 


gly 


thr 


ala 


val 


ala 


ala 


asp 


301/101 




































AAA 


TCC 


GGA 


CCA 


CTG 


CAT 


CAG 


GAT 


C 






















lys 


ser 


gly 


pro 


leu 


his 


gin 


asp) 






































SEQ ZD NOS:447- 


451 
































FIG. 


20A' 




















^SEQ ID 


NO: 


452 












31/11 


















dTG 


GGA 


CTC 


ACG 


TTC 


GGT 


GAG 


GGT 


ACA 


GCG 


GAC CTT 


CGA 


GCA 


CGC 


AAT 


ATC 


GTG 


GGC 


CGG 


(val 


gly 


leu 


thr 


phe 


gly 


glu 


gly 


thr 


ala 


asp leu 


arg 


ala 


arg 


asn 


ile 


val 


gly 


arg 


61/21 


-SEQ 


ID NO: 453 










91/31 


















CTG 


GCA 


ACC 


GTC 


GGT 


TTC 


GAC 


GTT 


GGT 


GAC 


GAC CCC 


TCG 


TTC 


ATG 


AAT 


CGT 


TCT 


TGA 


GCT 


leu 


ala 


thr 


val 


gly 


phe 


asp 


val 


gly 


asp 


asp pro 


ser 


phe 


met 


asn 


arg 


ser )OPA(ala 


121/41 


















151/51 




SEQ 


ID NO: 454 - 








CCC 


CGT 


TTT 


GCT 


GGA 


TGC 


CCA 


GGC 


ACC 


GCC 


GGT ACT 


GCT 


GCG 


CTT 


AAG 


CTT 
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GCA 


CAT 


pro 
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ala 
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cys 


pro 


gly 
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ala 
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ala 


ala 
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leu 
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ala 


his 


181/61 


















211/71 
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— SEQ ID NO: 455 
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pro 
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pro 


thr 
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SEQ ID NOS: 452-455 
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i/l^- — SEQ ID NO: 456 31/11 

GTG TGG GAC TCA CGT TCG GTG AGG GTA CAG CGG ACC TTC GAG CAC GCA ATA TCG TGG GCC 
(val tr p asp ser arg ser val arg val gin arg thr phe glu his ala ile ser trp ala 
61/21 SEQ ID NO: 457 91/31 

GGC TGG CAA CCG TCG GTT TCG ACG TTG GTG ACG ACC CCT CGT TCA TGA ATC GTT CTT GAG 
gly trp gin pro ser val ser thr leu val thr thr pro arg ser)OPA(ile val leu glu 
121/41 151/51 SEQ ID NO: 458-" 

CTC CCC GTT TTG CTG GAT GCC CAG GCA CCG CCG GTA CTG CTG CGC TTA AGC TTG TCG CAC 
leu pro val leu leu asp ala gin ala pro pro val leu leu arg leu ser leu ser his 
181/61 211/71 

ATG GTG CCG GCA GGG AGG AAC AGT GGG CAA GCA GCT AGC CGC GCT CGC CGC GCT GGT CGG 
met val pro ala gly arg asn ser gly gin ala ala ser arg ala arg arg ala gly arg 
241/81 271/91 

TGC GTG CAT GCT CGC AGC CGG ATG CAC CAA CGT GGT CGA CGG GAC CGC CGT GGC TGC CGA 
cys val his ala arg ser arg met his gin arg gly arg arg asp arg arg gly cys arg 
301/101 

CAA ATC CGG ACC ACT GCA TCA GGA TC 
gin ile arg thr thr ala ser gly) 

SEQ XD NOS: 456-458 

FIG. 200' 

sequence Rv3576 predicted by Cole et al . (Nature 393:537-544) and containing 
seq20A' 

lj^l SEQ ID NO: 459 31/11 

atg ggc aag cag eta gec gcg etc gec gcg ctg gtc ggt gcg tgc atg etc gca gec gga 
( met gly lys gin leu ala ala leu ala ala leu val gly ala cys met leu ala ala gly 
6*1721 * SEQ ID NO: 460 91/31 

tgc acc aac gtg gtc gac ggg acc gec gtg get gee gac aaa tec gga cca ctg cat cag 
cys thr asn val val asp gly thr ala val ala ala asp lys ser gly pro leu his gin 
121/41 151/51 

gat ccg ata ccg gtt tea gcg ctt gaa ggg ctg ctt etc gac ttg age cag ate aat gee 
asp pro ile pro val ser ala leu glu gly leu leu leu asp leu ser gin ile asn ala 
181/61 211/71 

gcg ctg ggt gcg aca teg atg aag gtg tgg ttc aac gec aag gca atg tgg gac tgg age 
ala leu gly ala thr ser met lys val trp phe asn ala lys ala met trp asp trp ser 
241/81 271/91 

aag age gtg gee gac aag aat tgc ctg get ate gac ggt cca gca cag gaa aag gtc tat 
lys ser val ala asp lys asn cys leu ala ile asp gly pro ala gin glu lys val tyr 
301/101 331/111 

gee ggc acc ggg tgg acc get atg cgc ggc caa egg ctg gat gac age ate gat gac tec 
ala gly thr gly trp thr ala met arg gly gin arg leu asp asp ser ile asp asp ser 
361/121 391/131 

aag aaa cgc gac cac tac gee att caa gcg gtc gtc ggc ttc ccg acc gca cat gat gee 
lys lys arg asp his tyr ala ile gin ala val val gly phe pro thr ala his asp ala 
421/141 451/151 

gag gag ttc tac age tec teg gtg caa age tgg age age tgc teg aac cgc egg ttt gtc 
glu glu phe tyr ser ser ser val gin ser trp ser ser cys ser asn arg arg phe val 
481/161 511/171 

gaa gtc acc ccc gga cag gac gac gee gee tgg act gtg get gac gtt gtc aac gac aac 
glu val thr pro gly gin asp asp ala ala trp thr val ala asp val val asn asp asn 
541/181 571/191 

ggc atg etc agt age teg cag gtt cag gaa ggc ggc gac gga tgg acc tgc cag cgt gec 
gly met leu ser ser ser gin val gin glu gly gly asp gly trp thr cys gin arg ala 
601/201 631/211 

ctg act gcg cgc aac aac gtc act ate gac att gtc acg tgc gee tat age caa ccg gat 
leu thr ala arg asn asn val thr ile asp ile val thr cys ala tyr ser gin pro asp 
661/221 691/231 

ttg gtg gcg att ggc ate get aac caa ate gcg gee aag gtt get aag cag tag 
leu val ala ile gly ile ala asn gin ile ala ala lys val ala lys gin) AMB 

SEQ XD NOS : 459-460 



FIG. 20D 




ORF according to Cole et al . {Nature 393:537-544) and containing Rv3576 
1/1^ SEQ ID NOs 461 31/11 

taa get tgt cgc aca tgg tgc egg cag gga gga aca gtg ggc aag cag eta gee gcg etc 
OCH(ala cys arg thr trp cys arg gin gly gly thr val gly lys gin leu ala ala leu 
61/21"" SEQ ID NO: 462 91/31 

gee gcg ctg gtc ggt gcg tgc atg etc gca gee gga tgc ace aac gtg gtc gac ggg acc 
ala ala leu val gly ala cys met leu ala ala gly cys thr asn val val asp gly thr 
121/41 151/51 

gee gtg get gee gac aaa tec gga cca ctg cat cag gat ccg ata ccg gtt tea gcg ctt 
ala val ala ala asp lys ser gly pro leu his gin asp pro ile pro val ser ala leu 
181/61 211/71 

gaa ggg ctg ctt etc gac ttg age cag ate aat gee gcg ctg ggt gcg aca teg atg aag 
glu gly leu leu leu asp leu ser gin ile asn ala ala leu gly ala thr ser met lys 
241/81 271/91 

gtg tgg ttc aac gee aag gca atg tgg gac tgg age aag age gtg gee gac aag aat tgc 
val trp phe asn ala lys ala met trp asp trp ser lys ser val ala asp lys asn cys 
301/101 331/111 

ctg get ate gac ggt cca gca cag gaa aag gtc tat gee ggc acc ggg tgg acc get atg 
leu ala ile asp gly pro ala gin glu lys val tyr ala gly thr gly trp thr ala met 
361/121 391/131 

cgc ggc caa egg ctg gat gac age ate gat gac tec aag aaa cgc gac cac tac gee att 
arg gly gin arg leu asp asp ser ile asp asp ser lys lys arg asp his tyr ala ile 
421/141 451/151 

caa gcg gtc gtc ggc ttc ccg acc gca cat gat gee gag gag ttc tac age tec teg gtg 
gin ala val val gly phe pro thr ala his asp ala glu glu phe tyr ser ser ser val 
481/161 511/171 

caa age tgg age age tgc teg aac cgc egg ttt gtc gaa gtc acc ccc gga cag gac gac 
gin ser trp ser ser cys ser asn arg arg phe val glu val thr pro gly gin asp asp 
541/181 571/191 

gee gee tgg act gtg get gac gtt gtc aac gac aac ggc atg etc agt age teg cag gtt 
ala ala trp thr val ala asp val val asn asp asn gly met leu ser ser ser gin val 
601/201 631/211 

cag gaa ggc gge gac gga tgg acc tgc cag cgt gee ctg act gcg cgc aac aac gtc act 
gin glu gly gly asp gly trp thr cys gin arg ala leu thr ala arg asn asn val thr 
661/221 691/231 

ate gac att gtc acg tgc gee tat age caa ccg gat ttg gtg gcg att ggc ate get aac 
ile asp ile val thr cys ala tyr ser gin pro asp leu val ala ile gly ile ala asn 
721/241 

caa ate gcg gee aag gtt get aag cag tag 
gin ile ala ala lys val ala lys gin) AMB 



SEQ ID NOS: 461-462 



FIG. 20E 



1/ 1 — SEQ ID NO: 463 31/11 

GTC CTG GTC GCC GCG CAA CTG GCC GGT CCC GAT GGA AAG TGT TCA CGA TCG CGC TTC TGC 

(valleu val ala ala gin leu ala gly pro asp gly lys cys ser arg ser arg phe cys 
61/2 SEQ ID NO: 464 SEQ ID NO: 466 91/31 

CGC TGG TAG TGG CGA TGG TGT TAG CAG GAT TGC GGG TCG AGG CTG CGA TGG CCA GCA CCA 

arg trp)AMB(trp arg trp cys)AMB(gln asp cys gly ser arg leu arg trp pro ala pro 
121/41 ^SEQ ID NO: 465 151/51 

GCG GCC TGC GGC TGG TCG CCG CGC GCG CCG AAA TGA TAC CCG CGA TCA CGA AAT ACA TGT 

ala ala cys gly trp ser pro arg ala pro lys)OPA(tyr pro arg ser arg asn thr cys 



181/61 211/71 — SEQ ID NO: 467 

CGG CGC TGG ACG TCG CCG TGC TGG CCA GCT CGA CCG GAC ACG ATG TGG AGG GGG CGC AGA 
arg arg trp thr ser pro cys trp pro ala arg pro asp thr met trp arg gly arg arg 
241/81 > 271/91 

AAA ACT TCA CCG CCC GCA AGT ACG AGC TGC AGA CGC GAC TGG CCG ACA CCG ACG TCA TCG 
lys thr ser pro pro ala ser thr ser cys arg arg asp trp pro thr pro thr ser ser 
301/101 331/111 

CAG ACG TGC GGT CGG GAG TGA ACA CGC TGC TCA ACG GCG GTC AGG CGC TGC TGG ATA AGA 
gin thr cys gly arg glu)OPA(thr arg cys ser thr ala val arg arg cys trp ile arg 
361/121 ^ SEQ ID NO: 468 

TGC TGG CCG ACA GCA TCG GCT TGC GGG ATC 
cys trp pro thr ala ser ala cys gly ile) 



SEQ ID NOSs 463-468 

FIG. 21 A 



^ -SEQ ID NO: 469 32/11 

TCC TGG TCG CCG CGC AAC TGG CCG GTC CCG ATG GAA AGT GTT CAC GAT CGC GCT TCT GCC 
( ser t rp ser pro arg asn trp pro val pro met glu ser val his asp arg ala ser ala 
62/21"--- SEQ ID NO: 470 92/31 

GCT GGT AGT GGC GAT GGT GTT AGC AGG ATT GCG GGT CGA GGC TGC GAT GGC CAG CAC CAG 
ala gly ser gly asp gly val ser arg ile ala gly arg gly cys asp gly gin his gin 
122/41 152/51 

CGG CCT GCG GCT GGT CGC CGC GCG CGC CGA AAT GAT ACC CGC GAT CAC GAA ATA CAT GTC 
arg pro ala ala gly arg arg ala arg arg asn asp thr arg asp his glu ile his val 
182/61 ■ 212/71 

GGC GCT GGA CGT CGC ■ CGT GCT GGC CAG CTC GAC CGG ACA CGA TGT GGA GGG GGC GCA GAA 
gly ala gly arg arg arg ala gly gin leu asp arg thr arg cys gly gly gly ala glu 
242/81 272/91 

AAA CTT CAC CGC CCG CAA GTA CGA GCT GCA GAC GCG ACT GGC CGA CAC CGA CGT CAT CGC 
lys leu his arg pro gin val arg ala ala asp ala thr gly arg his arg arg his arg 
302/101 332/111 

AGA CGT GCG GTC GGG AGT GAA CAC GCT GCT CAA CGG CGG TCA GGC GCT GCT GGA TAA GAT 
arg arg ala val gly ser glu his ala ala gin arg arg ser gly ala ala gly)OCHJ[asp 
362/121 SEQ ID NO: 471- 

,.JGCT ;GGQ^ GGA TC _ 

.. ala,,giy:. arg J gin .his . arg leu ala gly) \ . .." ' \ 

SEQ ID NOS:469-471 
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SEQ ID NO: 472 33/11 

CCT. GGT CGC CGC GCA ACT GGC CGG TCC CGA TGG AAA GTG TTC ACG ATC GCG CTT CTG.CCG 

( pro gl y arg arg ala thr gly arg ser arg trp lys val phe thr ile ala leu leu pro 
63/21^-SEEQ ID NO: 473 93/31 

CTG GTA GTG GCG ATG GTG TTA GCA GGA TTG CGG GTC GAG GCT GCG ATG GCC AGC ACC AGC 

leu val val ala met val leu ala gly leu arg val glu ala ala met ala ser thr ser 
123/41 153/51 

GGCJ CTG CGG CTG GTC GCC GCG CGC GCC GAA ATG ATA CCC GCG ATC ACG AAA TAC ATG TCG 

glU leu arg leu val ala ala arg ala glu met ile pro ala ile thr lys tyr met ser 
183V61 213/71 

GCG CTG GAC GTC GCC GTG CTG GCC AGC TCG ACC GGA CAC GAT GTG GAG GGG GCG CAG AAA 

ala leu asp val ala val leu ala ser ser thr gly his asp val glu gly ala gin lys 
243/81 273/91 

AAC TTC ACC GCC CGC AAG TAC GAG CTG CAG ACG CGA CTG GCC GAC ACC GAC GTC ATC GCA 

asn phe thr ala arg lys tyr glu leu gin thr arg leu ala asp thr asp val ile ala 
303/101 333/111 

GAC GTG CGG TCG GGA GTG AAC ACG CTG CTC AAC GGC GGT CAG GCG CTG CTG GAT AAG ATG 

asp val arg ser gly val asn thr leu leu asn gly gly gin ala leu leu asp lys met 
363/121 

CTG GCC GAC AGC ATC GGC TTG CGG GAT C 

leu ala asp ser ile gly leu arg asp) 

SEQ ID NOS: 472-473 

FIG. 21 C 



part of the nucleotide sequence of seq21A 



1/1 ^SEQ ID NO: 


474 










31/11 
















ACG ATC 


GCG 


CTT 
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CTG 


GTA 


GTG 
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ATG GTG 


TTA 
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TTG 


CGG 
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pro 
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ala 


met val 


leu 


ala 


gly 


leu 


arg 


val 


glu 


61/21^- 


SEQ 


ZD NO: 475 










91/31 
















GCG ATG 


GCC 


AGC 


ACC 


AGC 


GGC 


CTG 


CGG 


CTG 


GTC GCC 


GCG 


CGC 


GCC 


GAA 


ATG 


ATA 


CCC 


ala met 


ala 


ser 


thr 


ser 


gly 


leu 


arg 


leu 


val ala 


ala 


arg 


ala 


glu 


met 


ile 


pro 
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151/51 
















ATC ACG 


AAA 


TAC 


ATG 


TCG 


GCG 


CTG 


GAC 


GTC 


GCC GTG 


CTG 


GCC 


AGC 


TCG 


ACC 


GGA 


CAC 


ile thr 


lys 


tyr 


met 


ser 


ala 


leu 


asp 


val 


ala val 


leu 


ala 


ser 


ser 


thr 


gly 


his 
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211/71 
















GTG GAG 


GGG 


GCG 


CAG 


AAA 


AAC 


TTC 


ACC 


GCC 


CGC AAG 


TAC 


GAG 


CTG 


CAG 


ACG 


CGA 


CTG 


val glu 


gly 


ala 


gin 


lys 


asn 


phe 


thr 


ala 


arg lys 


tyr 


glu 


leu 


gin 


thr 


arg 


leu 


241/81 


















271/91 
















GAC ACC 


GAC 


GTC 


ATC 


GCA 


GAC 


GTG 


CGG 


TCG 


GGA GTG 


AAC 


ACG 


CTG 


CTC 


AAC 


GGC 


GGT 


asp thr 


asp 


val 


ile 


ala 


asp 


val 


arg 


ser 


gly val 


asn 


thr 


leu 


leu 


asn 


gly 


gly 


301/101 


















331/111 
















GCG CTG 


CTG 


GAT 


AAG 


ATG 


CTG 


GCC 


GAC 


AGC 


ATC GGC 


TTG 


CGG 


GAT 


C 








ala leu 


leu 


asp 
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met 


leu 


ala 


asp 


ser 


ile gly 


leu 


arg 


asp) 











SEQ ID NOS: 474-475 
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1/1 — SEQ ID NO: 476 SEQ ID NO: 478 31/11 

CGA TCG CGC TTC TGC CGC TGG TAG ^ TGG CGA TGG TGT TAG CAG GAT TGC GGG TCG AGG CTG 
( arg se r arg phe cys arg trp)AMB(trp arg trp cys)AMB(gln asp cys gly ser arg leu 

61/ 21"^- SEQ ID NO: 477 91/31 ' v — - SEQ ID NO: 479 

CGA TGG CCA GCA CCA GCG GCC TGC GGC TGG TCG CCG CGC GCG CCG AAA TGA TAC CCG CGA 

arg trp pro ala pro ala ala cys gly trp ser pro arg ala pro lys)OPA(tyr pro arg 

121/41 151/51 SEQ ID NO: 480—*' 

TCA CGA AAT ACA TGT CGG CGC TGG ACG TCG CCG TGC TGG CCA GCT CGA CCG GAC ACG ATG 

ser arg asn thr cys arg arg trp thr ser pro cys trp pro ala arg pro asp thr met 

181/61 211/71 

TGG AGG GGG CGC AGA AAA ACT TCA CCG CCC GCA AGT ACG AGC TGC AGA CGC GAC TGG CCG 

trp arg gly arg arg lys thr ser pro pro ala ser thr ser cys arg arg asp trp pro 

241/81 271/91 

ACA CCG ACG TCA TCG CAG ACG TGC GGT CGG GAG TGA ACA CGC TGC TCA ACG GCG GTC AGG 

thr pro thr ser ser gin thr cys gly arg giu)OPA(thr arg cys ser thr ala val arg 

301/101 331/111 ^ — SEQ ID NO: 481 

CGC TGC TGG ATA AGA TGC TGG CCG ACA GCA TCG GCT TGC GGG ATC 

arg cys trp ile arg cys trp pro thr ala ser ala cys gly ile) 

SEQ ID NOS:476-481 
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1/1_— SEQ ID NO: 482 31/11 

CAC GAT CGC GCT TCT GCC GCT GGT AGT GGC GAT GGT GTT AGC AGG ATT GCG GGT CGA GGC 
( his as p arg ala ser ala ala gly ser gly asp gly val ser arg ile ala gly arg gly 
61/21 ^SEQ ID NO: 483 91/31 

TGC GAT GGC CAG CAC CAG CGG CCT GCG GCT GGT CGC CGC GCG CGC CGA AAT GAT ACC CGC 
cys asp gly gin his gin arg pro ala ala gly arg arg ala arg arg asn asp thr arg 
121/41 151/51 

GAT CAC GAA ATA CAT GTC GGC GCT GGA CGT CGC CGT GCT GGC CAG CTC GAC CGG ACA CGA 
asp his glu ile his val gly ala gly arg arg arg ala gly gin leu asp arg thr arg 
181/61 211/71 

TGT GGA GGG GGC GCA GAA AAA CTT CAC CGC CCG CAA GTA CGA GCT GCA GAC GCG ACT GGC 
cys gly gly gly ala glu lys leu his arg pro gin val arg ala ala asp ala thr gly 
241/81 271/91 

CGA CAC CGA CGT CAT CGC AGA CGT GCG GTC GGG AGT GAA CAC GCT GCT CAA CGG CGG TCA 
arg his arg arg his arg arg arg ala val gly ser glu his ala ala gin arg arg ser 
301/101 331/111 

GGC GCT GCT GGA TAA GAT GCT GGC CGA CAG CAT CGG CTT GCG GGA TC 
gly ala ala gly)OCH(asp ala gly arg gin his arg leu ala gly) 

^ — SEQ ID NO: 484 



SEQ ID NOS: 482-484 
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sequence Rv3365c predicted by Cole et al . (Nature 393:537-544) and 
containing Seq2 1A ■ 

1/1__^-SEQ ID NO: 485 31/11 

gtg acc atg ttc gcc cgc ccg acc ate ccg gtc gcg gcg gec get tct gat att tec gec 
( val th r met phe ala arg pro thr ile pro val ala ala ala ala ser asp ile ser ala 
61/21^" SEQ ID NO: 486 91/31 

ccg get caa ccg gcc cgc ggc aaa cct cag caa cgc ccg ccg tec tgg teg ccg cgc aac 
pro ala gin pro ala arg gly lys pro gin gin arg pro pro ser trp ser pro arg asn 
121/41 151/51 

tgg ccg gtc cga tgg aaa gtg ttc acg ate gcg ctt ctg ccg ctg gta gtg gcg atg gtg 
trp pro val arg trp lys val phe thr ile ala leu leu pro leu val val ala met val 
181/61 211/71 

tta gca gga ttg egg gtc gag get gcg atg gcc age acc age ggc ctg egg ctg gtc gcc 
leu ala gly leu arg val glu ala ala met ala ser thr ser gly leu arg leu val ala 
241/81 271/91 

gcg cgc gcc gaa atg ata ccc gcg ate acg aaa tac atg teg gcg ctg gac gtc gcc gtg 
ala arg ala glu met ile pro ala ile thr lys tyr met ser ala leu asp val ala val 
301/101 331/111 

ctg gcc age teg acc gga cac gat gtg gag ggg gcg cag aaa aac ttc acc gcc cgc aag 
leu ala ser ser thr gly his asp val glu gly ala gin lys asn phe thr ala arg lys 
361/121 391/131 

tac gag ctg cag acg cga ctg gcc gac acc gac gtc ate gca gac gtg egg teg gga gtg 
tyr glu leu gin thr arg leu ala asp thr asp val ile ala asp val arg ser gly val 
421/141 451/151 

aac acg ctg etc aac ggc ggt cag gcg ctg ctg gat aag gtg ctg gcc gac age ate ggc 
asn thr leu leu asn gly gly gin ala leu leu asp lys val leu ala asp ser ile gly 
481/161 511/171 

ttg egg gat egg gtc acc gcc tac gcg ccg ctg ctg ttg acg gcc cag aac gtg att gac 
leu arg asp arg val thr ala tyr ala pro leu leu leu thr ala gin asn val ile asp 
541/181 571/191 

gcg teg gtg egg gtt gac age gag caa ate cga acc cag gtg cag ggt ttg age cga gcc 
ala ser val arg val asp ser glu gin ile arg thr gin val gin gly leu ser arg ala 
601/201 631/211 

gtt ggc gcc cgc ggg cag atg acg atg cag gag ate ctg gtg act cgc ggc gcc gac ctt 
val gly ala arg gly gin met thr met gin glu ile leu val thr arg gly ala asp leu 
661/221 691/231 

gcc gag ccg caa ctg cgc age gcg atg gtt acc ctg gcc ggc acc gaa ccc teg acg ctg 
ala glu pro gin leu arg ser ala met val thr leu ala gly thr glu pro ser thr leu 
721/241 751/251 

ttc ggg atg age gcg gcg etc ggt gca ggc teg ccg gac acc aag aac ctg cag cag caa 
phe gly met ser ala ala leu gly ala gly ser pro asp thr lys asn leu gin gin gin 
781/261 811/271 

atg gtg acc agg atg gcg ate atg tec gat ccg gcc gtt gca ctg gtc aac aac cca gag 
met val thr arg met ala ile met ser asp pro ala val ala leu val asn asn pro glu 
841/281 871/291 

ctg ctg cac teg ata cag ate acc cgc gac att gcc gag cag gtg ate acc gac acc acc 
leu leu his ser ile gin ile thr arg asp ile ala glu gin val ile thr asp thr thr 
901/301 931/311 

gag gcg gtg acg aag teg gtg caa age cag gcc acc gac egg egg gat gcc gcg att cgc 
glu ala val thr lys ser val gin ser gin ala thr asp arg arg asp ala ala ile arg 
961/321 991/331 

gac gcc gtg ctg gtg ttg gcc gcc ate gcg acc gcg ate gtc gtc gtg ttg gtg gtg gcg 
asp ala val leu val leu ala ala ile ala thr ala ile val val val leu val val ala 



SEQ JD NOS: 485-486 



FIG. 21 D 




1021/341 

cgc acg ctg gtc ggg ccg atg egg gta 
arg thr leu val gly pro met arg val 
1081/361 

gat etc gac ggc gag ate gcg gcg gtc 
asp leu asp gly glu ile ala ala val 
1141/381 

gcg gtg tac acc ace gag gaa ate ggt 
ala val tyr thr thr glu glu ile gly 
1201/401 

egg gee ctg ttg ctg gee ggc gag gaa 
arg ala leu leu leu ala gly glu glu 
1261/421 

gag acc atg teg egg cgt age cgt tec 
glu thr met ser arg arg ser arg ser 
1321/441 

ctg gag cgc aac gag gag gat ccc gee 
leu glu arg asn glu glu asp pro ala 
1381/461 

gee gee egg ctg cgc cgc aac age gee 
ala ala arg leu arg arg asn ser ala 
1441/481 

cgt gac cac cgc gag ccg gtg ccg ctg 
arg asp his arg glu pro val pro leu 
1501/501 

gag gac tat cgc cgc gtc gac ate gcg 
glu asp tyr arg arg val asp ile ala 
1561/521 

get ggt ggc gtc att cat ctg ctt gee 
ala. gly gly val ile his leu leu ala 
1621/541 

ccg acc aca ccc gtt egg gtt gee gee 
pro thr thr pro val arg val ala ala 
1681/561 

ate teg gat tec ggc ctg ggc atg acc 
ile ser asp ser gly leu gly met thr 
1741/581 

egg gee ggc ggt gag gtc acc ccg gat 
arg ala gly gly glu val thr pro asp 
1801/601 

egg ctg gee ggt egg cac ggc ate cga 
arg leu ala gly arg his gly ile arg 
1861/621 

ggc acc ggc acc acc gee gag gtc tac 
gly thr gly thr thr ala glu val tyr 
1921/641 

gcg cag ccg cca aag ccg egg gta ttt 
ala gin pro pro lys pro arg val phe 
1981/661 

gee gat ccg acg gac gtt ccc gec gee 
ala asp pro thr asp val pro ala ala 





lUbl/ iol 
















ctg 


cgt gat ggg 


gcg 


etc 


aag 


gtt 


get 


cat 


acc 


1 eu 


arg asp gly 


aia 


leu 


lys 


val 


ala 


nis 


4- Vi -v 




1111/ J71 
















cgc 


gee ggc gac 


gag 


ccg 


ate 


ccc 


gag 


cca 


ctg 


arg 


ala gly asp 


glu 


pro 


ile 


pro 


glu 


pro 


leu 




1171 / 391 
















cag 


gtc gcg cat 


gcg 


gtc 


gac 


gag 


ctg 


cac 


acc 


gin 


val ala his 


ala 


val 


asp 


glu 


leu 


his 


thr 




1231/411 
















acg 


egg ttg cga 


ctg 


ctg 


gtc 


aac 


gag 


atg 


ttt 


thr 


arg leu arg 


leu 


leu 


val 


asn 


glu 


met 


phe 




1291/431 
















ctg 


gtc gac cag 


cag 


ctg 


teg 


gtc 


ate 


gac 


caa 


leu 


val asp gin 


gin 


leu 


ser 


val 


ile 


asp 


gin 




13 51/451 
















cga 


etc gac age 


ctt 


ttc 


egg 


etc 


gat 


cac 


ctg 


arg 


leu asp ser 


leu 


phe 


arg 


leu 


asp 


his 


leu 




1411 / 47 1 
















aac 


ctg ctg gtg 


ctg 


gee 


ggt 


gcg 


cag 


att 


acc 


asn 


leu leu val 


leu 


ala 


gly 


ala 


gin 


ile 


thr 




14 / i/ 4yi 
















tea 


acc gtg ate 


age 


gee 


gee 


gtg 


tea 


gag 


gtc 


ser 


+— V"i ■>— -tra 1 A T a 
1— 1 1 X. V Cl X J Lc 


ser 


axa 


cl lei 


Va 1 


ser 


glu 


val 




1 _) J 1 / Jll 
















agg 


gta ccc gac 


tgt 


gcg 


gca 


gtc 


ggc 


gca 


gcg 


arg 


val pro asp 


cys 


aia 


val 


vai 


gly 


aia 


ala 




I'CQI /CI 1 

1 D 1 / jjI 
















gag 


ctg ate gac 


aac 


gcg 


ttg 


cgc 


tac 


teg 


tea 


glu 


leu ile asp 


asn 


ala 


leu 


arg 


tyr 


ser 


ser 




1 CC1 /CC1 

IOjI/jDI 
















gca 


ate ggc age 


gaa 


ggc 


age 


gtt 


ctg 


ctg 


cga 


ala 


ile gly ser 


glu 


gly 


ser 


val 


leu 


leu 


arg 




1 /ll/b / l 
















gat 


gee gat egg 


egg 


atg 


gec 


aat 


atg 


egg 


ctg 


asp 


ala asp arg 


arg 


me t 


aia 


asn 


met 


arg 


leu 




1771/591 
















agt 


gec egg cac 


atg 


ggt 


ctg 


ttc 


gta 


gtc 


ggc 


ser 


ala arg his 


met 


gly 


leu 


pne 


val 


val 


gly 




lo Jl/ oil 
















gtc 


ggg ctg cgc 


ggt 


ccg 


gtg 


acc 


ggt 


gaa 


cag 


val 


gly leu arg 


gly 


pro 


val 


thr 


gly 


glu 


gin 




1891/631 
















ctg 


ccg eta gec 


gtg 


etc 


gag 


ggg 


acg 


gee 


cca 


leu 


pro leu ala 


val 


leu 


glu 


gly 


thr 


ala 


pro 




1951/651 
















gcg 


ate aag ccg 


ccg 


tgt 


cct 


gaa 


ccc 


gcg 


gcg 


ala 


ile lys pro 


pro 


cys 


pro 


glu 


pro 


ala 


ala 




2011/671 
















ate 


ggg ccg eta 


cca 


ccg 


gtc 


acg 


ttg 


etc 


ccg 


ile 


gly pro leu 


pro 


pro 


val 


thr 


leu 


leu 


pro 



SEQ XD NOS: 485-486 (continued 1) 

FIG. 21 D (continued 1) 



2041/681 2071/691 



cgc cgt acc 


ccg 


ggg 


tec 


agt 


ggc 


ate 


gec 


gac gtc ccg 


gec 


cag 


ccg 


atg 


cag 


cag 


egg 


arg arg thr 


pro 


gly 


ser 


ser 


gly 


ile 


ala 


asp val pro 


ala 


gin 


pro 


met 


gin 


gin 


arg 


2101/701 
















2131/711 
















egg cgc gag 


ctg 


aaa 


aca 


ccc 


tgg 


tgg 


gag 


gat agg ttt 


caa 


cag 


gag 


ccc 


aaa 


caa 


ccg 


arg arg glu 


leu 


lys 


thr 


pro 


trp 


trp 


glu 


asp arg phe 


gin 


gin 


glu 


pro 


lys 


gin 


pro 


2161/721 
















2191/731 
















ccc gca cca 


gaa 


ccg 


cga 


ccg 


gcg 


ccg 


ccg 


ccc gee aaa 


ccc 


gcg 


cca 


ccg 


gcg 


ggc 


ccg 


pro ala pro 


glu 


pro 


arg 


pro 


ala 


pro 


pro 


pro ala lys 


pro 


ala 


pro 


pro 


ala 


gly 


pro 


2221/741 
















2251/751 
















gtt gat gac 


gac 


gtc 


ate 


tac 


egg 


egg 


atg 


etc tec gag 


atg 


gtg 


ggt 


gac 


ccg 


cac 


gag 


val asp asp 


asp 


val 


ile 


tyr 


arg 


arg 


met 


leu ser glu 


met 


val 


gly 


asp 


pro 


his 


glu 


2281/761 
















2311/771 
















ctg gec cac 


age 


ccc 


gat 


ctg 


gac 


tgg 


aag 


teg gtg tgg 


gac 


cac 


ggc 


tgg 


teg 


gcg 


gee 


leu ala his 


ser 


pro 


asp 


leu 


asp 


trp 


lys 


ser val trp 


asp 


his 


gly 


trp 


ser 


ala 


ala 


2.3 41 / Vol 
















2371/791 
















gec gag gec 


gcg 


gac 


aag 


ccc 


gtg 


cag 


tec 


cgc acg gac 


tac 


ggc 


ctg 


ccg 


gtg 


cgc 


gaa 


ala glu ala 


ala 


asp 


lys 


pro 


val 


gin 


ser 


arg thr asp 


tyr 


gly 


leu 


pro 


val 


arg 


glu 


2401/801 
















2431/811 
















ccc ggg gec 


egg 


tta 


gtg 


ccg 


ggg 


gcg 


gcg 


gtg cct gag 


gga 


ccc 


gat 


egg 


gag 


cat 


ccg 


pro gly ala 


arg 


leu 


val 


pro 


gly 


ala 


ala 


val pro glu 


gly 


pro 


asp 


arg 


glu 


his 


pro 


2461/821 
















2491/831 
















ggt gca gcg 


eta 


gca 


tec 


aac 


ggc 


gga 


ctt 


cat ccc ggc 


cga 


gcg 


ccg 


egg 


cac 


gcg 


get 


gly ala ala 


leu 


ala 


ser 


asn 


gly 


gly 


leu 


his pro gly 


arg 


ala 


pro 


arg 


his 


ala 


ala 


2521/841 
















2551/851 
















gcg gta cgc 


gac 


ccc 


gac 


gcg 


gtt 


cgt 


gec 


tec ate age 


age 


cat 


ttc 


ggc 


ggc 


gtg 


cgc 


ala val arg 


asp 


pro 


asp 


ala 


val 


arg 


ala 


ser ile ser 


ser 


his 


phe 


gly 


gly 


val 


arg 


2581/861 
















2611/871 
















acc ggg egg 


teg 


cat 


gee 


cgc 


gag 


age 


agt 


cag gga ccc 


aat 


cag 


caa 


tga 








thr gly arg 


ser 


his 


ala 


arg 


glu 


ser 


ser 


gin gly pro 


asn 


gin 


gin; 


)OPA 









SEQ ZD NOS: 485-486 (continued) 

FIG. 21 D ( c ont inued ) 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv3365c 



1/1^ SEQ ID NO: 487 










31/ 11 


















taa ggg 


tgc 


ggc 


egg 


tgg 


cac 


ggc 


cgc 


ggc 


cac gtg 


acc 


atg 


ttc 


gee 


cgc 


ccg 


acc 


ate 


OCH (gly 


cys 


gly 


arg 


trp 


his 


gly 


arg 


giy 


his val 


tnr 


met 


pne 


ala 


arg 


pro 


tnr 


ile 


61/2 1 — 


— SEQ ID 


NO : 


488 










91/31 


















ccg gtc 


gcg 


gcg 


gee 


get 


tct 


gat 


att 


tec 


gee ccg 


get 


caa 


ccg 


gee 


cgc 


ggc 


aaa 


cct 


pro val 


ala 


ala 


ala 


ala 


ser 


asp 


ile 


ser 


ala pro 


ala 


gin 


pro 


ala 


arg 


gly 


lys 


pro 


121/41 


















151/51 


















cag caa 


cgc 


ccg 


ccg 


tec 


tgg 


teg 


ccg 


cgc 


aac tgg 


ccg 


gtc 


cga 


tgg 


aaa 


gtg 


ttc 


acg 


gin gin 


arg 


pro 


pro 


ser 


trp 


ser 


pro 


arg 


asn trp 


pro 


val 


arg 


trp 


lys 


val 


phe 


thr 


181 / 61 


















211/71 


















ate gcg 


ct t 


ctg 


ccg 


ctg 


gta 


gtg 


gcg 


atg 


gtg tta 


gca 


gga 


ttg 


egg 


gtc 


gag 


get 


gcg 


ile ala 


leu 


leu 


pro 


leu 


val 


vai 


ala 


met 


va 1 1 eu 


ala 


giy 


leu 


arg 


val 


glu 


ala 


ala 


241 / 81 


















271/91 


















atg gec 


age 


acc 


age 


ggc 


ctg 


egg 


ctg 


gtc 


gee gcg 


cgc 


gee 


gaa 


atg 


ata 


ccc 


gcg 


ate 


met ala 


ser 


thr 


ser 


gly 


leu 


arg 


leu 


vai 


ala ala 


arg 


ala 


glu 


met 


ile 


pro 


ala 


ile 


jUI/ 1U1 


















"3 11 /111 
J j1 / 111 


















acg aaa 


tac 


atg 


teg 


gcg 


ctg 


gac 


gtc 


gee 


gtg ctg 


gee 


age 


teg 


acc 


gga 


cac 


gat 


gtg 


thr lys 


tyr 


met 


ser 


ala 


leu 


asp 


vai 


did 


val leu 


aia 


ser 


ser 


tnr 


gly 


nis 


asp 


val 


jbl/lzl 


















391/131 


















gag ggg 


gcg 


cag 


aaa 


aac 


t tc 


acc 


gec 


cgc 


aag tac 


gag 


ctg 


cag 


acg 


cga 


ctg 


gec 


gac 


glu gly 


ala 


gin 


lys 


asn 


phe 


tnr 


aia 


arg 


lys tyr 


glu 


leu 


gin 


tnr 


arg 


leu 


ala 


asp 


A O 1 /I yl 1 

4zl/141 


















4bl/ 151 


















acc gac 


gtc 


ate 


gca 


gac 


gtg 


egg 


teg 


gga 


gtg aac 


acg 


ctg 


etc 


aac 


ggc 


ggt 


cag 


gcg 


thr asp 


val 


ile 


ala 


asp 


vai 


arg 


ser 


gly 


val asn 


tnr 


leu 


leu 


asn 


gly 


giy 


gin 


ala 


481/161 


















bll/ 171 


















ctg ctg 


gat 


aag 


gtg 


ctg 


gee 


gac 


age 


ate 


ggc ttg 


egg 


gat 


egg 


gtc 


acc 


gee 


tac 


gcg 


leu leu 


asp 


lys 


val 


leu 


ala 


asp 


ser 


ile 


gly leu 


arg 


asp 


arg 


val 


4- V* -w. 

tnr 


ala 


tyr 


ala 


541/181 


















571 / 191 


















ccg ctg 


ctg 


ttg 


acg 


gee 


cag 


aac 


gtg 


att 


gac gcg 


teg 


gtg 


egg 


gtt 


gac 


age 


gag 


caa 


pro leu 


leu 


leu 


thr 


ala 


gin 


asn 


val 


ile 


asp ala 


ser 


val 


arg 


val 


asp 


ser 


glu 


gin 


601/201 


















631 /2 11 


















ate cga 


acc 


cag 


gtg 


cag 


ggt 


ttg 


age 


cga 


gee gtt 


ggc 


gee 


cgc 


ggg 


cag 


atg 


acg 


atg 


ile arg 


thr 


gin 


val 


gin 


gly 


leu 


ser 


arg 


ala val 


gly 


aia 


arg 


giy 


gin 


met 


i-Viv 

tnr 


met 


661/221 


















o y i / z j1 


















cag gag 


ate 


ctg 


gtg 


act 


cgc 


ggc 


gee 


gac 


ctt gee 


gag 


ccg 


caa 


ctg 


cgc 


age 


gcg 


atg 


gin glu 


ile 


leu 


val 


thr 


arg 


gly 


ala 


asp 


leu ala 


glu 


pro 


gin 


leu 


arg 


ser 


ala 


met 


721/241 


















751/251 


















gtt acc 


ctg 


gec 


ggc 


acc 


gaa 


ccc 


teg 


acg 


ctg ttc 


ggg 


atg 


age 


gcg 


gcg 


etc 


ggt 


gca 


val thr 


leu 


ala 


gly 


thr 


glu 


pro 


ser 


thr 


leu phe 


gly 


met 


ser 


ala 


ala 


leu 


gly 


ala 


781/261 


















811/271 


















ggc teg 


ccg 


gac 


acc 


aag 


aac 


ctg 


cag 


cag 


caa atg 


gtg 


acc 


agg 


atg 


gcg 


ate 


atg 


tec 


gly ser 


pro 


asp 


thr 


lys 


asn 


leu 


gin 


gin 


gin met 


val 


thr 


arg 


met 


ala 


ile 


met 


ser 


841/281 


















871/291 


















gat ccg 


gec 


gtt 


gca 


ctg 


gtc 


aac 


aac 


cca 


gag ctg 


ctg 


cac 


teg 


ata 


cag 


ate 


acc 


cgc 


asp pro 


ala 


val 


ala 


leu 


val 


asn 


asn 


pro 


glu leu 


leu 


his 


ser 


ile 


gin 


ile 


thr 


arg 



SEQ ZD NOS: 487-488 

FIG. 21 E 



2 S 200 



901/301 931/311 

gac att gcc gag cag gtg ate acc gac acc acc gag gcg gtg acg aag teg gtg caa age 

asp ile ala glu gin val ile thr asp thr thr glu ala val thr lys ser val gin ser 
961/321 991/331 

cag gcc acc gac egg egg gat gcc gcg att cgc gac gcc gtg ctg gtg ttg gcc gcc ate 

gin ala thr asp arg arg asp ala ala ile arg asp ala val leu val leu ala ala ile 
1021/341 1051/351 

gcg acc gcg ate gtc gtc gtg ttg gtg gtg gcg cgc acg ctg gtc ggg ccg atg egg gta 

ala thr ala ile val val val leu val val ala arg thr leu val gly pro met arg val 
1081/361 1111/371 

ctg cgt gat ggg gcg etc aag gtt get cat acc gat etc gac ggc gag ate gcg gcg gtc 

leu arg asp gly ala leu lys val ala his thr asp leu asp gly glu ile ala ala val 
1141/381 1171/391 

cgc gcc ggc gac gag ccg ate ccc gag cca ctg gcg gtg tac acc acc gag gaa ate ggt 

arg ala gly asp glu pro ile pro glu pro leu ala val tyr thr thr glu glu ile gly 
1201/401 1231/411 

cag gtc gcg cat gcg gtc gac gag ctg cac acc egg gcc ctg ttg ctg gcc ggc gag gaa 

gin val ala his ala val asp glu leu his thr arg ala leu leu leu ala gly glu glu 
1261/421 1291/431 

acg egg ttg cga ctg ctg gtc aac gag atg ttt gag acc atg teg egg cgt age cgt tec 

thr arg leu arg leu leu val asn glu met phe glu thr met ser arg arg ser arg ser 
1321/441 1351/451 

ctg gtc gac cag cag ctg teg gtc ate gac caa ctg gag cgc aac gag gag gat ccc gcc 

leu val asp gin gin leu ser val ile asp gin leu glu arg asn glu glu asp pro ala 
1381/461 1411/471 

cga etc gac age ctt ttc egg etc gat cac ctg gcc gcc egg ctg cgc cgc aac age gcc 

arg leu asp ser leu phe arg leu asp his leu ala ala arg leu arg arg asn ser ala 
1441/481 1471/491 

aac ctg ctg gtg ctg gcc ggt gcg cag att acc cgt gac cac cgc gag ccg gtg ccg ctg 

asn leu leu val leu ala gly ala gin ile thr arg asp his arg glu pro val pro leu 
1501/501 1531/511 

tea acc gtg ate age gcc gcc gtg tea gag gtc gag gac tat cgc cgc gtc gac ate gcg 

ser thr val ile ser ala ala val ser glu val glu asp tyr arg arg val asp ile ala 
1561/521 1591/531 

agg gta ccc gac tgt gcg gta gtc ggc gca gcg get ggt ggc gtc att cat ctg ctt gcc 

arg val pro asp cys ala val val gly ala ala ala gly gly val ile his leu leu ala 
1621/541 1651/551 

gag ctg ate gac aac gcg ttg cgc tac teg tea ccg acc aca ccc gtt egg gtt gcc gcc 

glu leu ile asp asn ala leu arg tyr ser ser pro thr thr pro val arg val ala ala 
1681/561 1711/571 

gca ate ggc age gaa ggc agt gtt ctg ctg cga ate teg gat tec ggc ctg ggc atg acc 

ala ile gly ser glu gly ser val leu leu arg ile ser asp ser gly leu gly met thr 
1741/581 1771/591 

gat gcc gat egg egg atg gcc aat atg egg ctg egg gcc ggc ggt gag gtc acc ccg gat 

asp ala asp arg arg met ala asn met arg leu arg ala gly gly glu val thr pro asp 
1801/601 1831/611 

agt gcc egg cac atg ggt ctg ttc gta gtc ggc egg ctg gcc ggt egg cac ggc ate cga 

ser ala arg his met gly leu phe val val gly arg leu ala gly arg his gly ile arg 



SEQ ZD NOS: 487-488 (continued 1) 

FIG. 21 E (continued 1) 



1861/621 

gtc ggg ctg cgc ggt ccg gtg acc ggt gaa 
val gly leu arg gly pro val thr gly glu 
1921/641 

ctg ccg eta gec gtg etc gag ggg acg gec 
leu pro leu ala val leu glu gly thr ala 
1981/661 

gcg ate aag ccg ccg tgt cct gaa ccc gcg 
ala ile lys pro pro cys pro glu pro ala 
2041/681 

ate ggg ccg eta cca ccg gtc acg ttg etc 
ile gly pro leu pro pro val thr leu leu 
2101/701 

gec gac gtc ccg gee cag ccg atg cag cag 
ala asp val pro ala gin pro met gin gin 
2161/721 

gag gat agg ttt caa cag gag ccc aaa caa 
glu asp arg phe gin gin glu pro lys gin 
2221/741 

ccg ccc gee aaa ccc gcg cca ccg gcg ggc 
pro pro ala lys pro ala pro pro ala gly 
2281/761 

atg etc tec gag atg gtg ggt gac ccg cac 
met leu ser glu met val gly asp pro his 
2341/781 

aag teg gtg tgg gac cac ggc tgg teg gcg 
lys ser val trp asp his gly trp ser ala 
2401/801 

tec cgc acg gac tac ggc ctg ccg gtg cgc 
ser arg thr asp tyr gly leu pro val arg 
2461/821 

gcg gtg cct gag gga ccc gat egg gag cat 
ala val pro glu gly pro asp arg glu his 
2521/841 

ctt cat ccc ggc cga gcg ccg egg cac gcg 
leu his pro gly arg ala pro arg his ala 
2581/861 

gee tec ate age age cat ttc ggc ggc gtg 
ala ser ile ser ser his phe gly gly val 
2641/881 

agt cag gga ccc aat cag caa tga 
ser gin gly pro asn gin gln)OPA 



1891/631 

cag ggc acc ggc acc acc gec gag gtc tac 
gin gly thr gly thr thr ala glu val tyr 
1951/651 

cca gcg cag ccg cca aag ccg egg gta ttt 
pro ala gin pro pro lys pro arg val phe 
2011/671 

gcg gee gat ccg acg gac gtt ccc gee gec 
ala ala asp pro thr asp val pro ala ala 
2071/691 

ccg cgc cgt acc ccg ggg tec agt ggc ate 
pro arg arg thr pro gly ser ser gly ile 
2131/711 

egg egg cgc gag ctg aaa aca ccc tgg tgg 
arg arg arg glu leu lys thr pro trp trp 
2191/731 

ccg ccc gca cca gaa ccg cga ccg gcg ccg 
pro pro ala pro glu pro arg pro ala pro 
2251/751 

ccg gtt gat gac gac gtc ate tac egg egg 
pro val asp asp asp val ile tyr arg arg 
2311/771 

gag ctg gee cac age ccc gat ctg gac tgg 
glu leu ala his ser pro asp leu asp trp 
2371/791 

gec gee gag gee gcg gac aag ccc gtg cag 
ala ala glu ala ala asp lys pro val gin 
2431/811 

gaa ccc ggg gec egg tta gtg ccg ggg gcg 
glu pro gly ala arg leu val pro gly ala 
2491/831 

ccg ggt gca gcg eta gca tec aac ggc gga 
pro gly ala ala leu ala ser asn gly gly 
2551/851 

get gcg gta cgc gac ccc gac gcg gtt cgt 
ala ala val arg asp pro asp ala val arg 
2611/871 

cgc acc ggg egg teg cat gee cgc gag age 
arg thr gly arg ser his ala arg glu ser 



SEQ XD NOS: 487-488 (continued 2) 

FIG. 21 E (continued 2) 



'SIP- 



^SEQ ID NO: 489 31/11 

CTA CGA CAA GGC AAA GGA GCA CAG GGT GAA GCG TGG ACT GAC GGT CGC GGT AGC CGG AGC 
( leu arg gin gly lys gly ala gin gly glu ala trp thr asp gly arg gly ser arg ser 
61/21 ■ — SEQ ID NO: 490 91/31 

CGC CAT TCT GGT CGC AGG TCT TTC CGG ATG TTC AAG CAA CAA GTC GAC TAC AGG AAG CGG 
arg his ser gly arg arg ser phe arg met phe lys gin gin val asp tyr arg lys arg) 
121/41 151/51 

TGA GAC CAC GAC CGC GGC AGG CAC GAC GGC AAG CCC CGG CGC CGC ATC CGG GCC GAA GGT 
OPA(a sp h is asp arg gly arg his asp gly lys pro arg arg arg ile arg ala glu gly 
181/61 ^SEQ ID NO: 491 211/71 

CGT CAT CGA CGG TAA GGA CCA GAA CGT CAC CGG GTC TGT GGT GTG CAC AAC CGC GGC CGG 
arg his arg arg)OCH(gly pro glu arg his arg val cys gly val his asn arg gly arg 
241/81 ^ — SEQ ID NO: 492 271/91 

CAA TGT CAA CAT CGC GAT CGG CGG GGC GGC GAC CGG CAT TGC CGC CGT GCT CAC CGA CGG 
gin cys gin his arg asp arg arg gly gly asp arg his cys arg arg ala his arg arg 
301/101 331/111 

CAA CCC TCC GGA GGT GAA GTC CGT TGG GCT CGG TAA CGT CAA CGG CGT CAC GCT GGG ATA 
gin pro ser gly gly glu val arg trp ala arg)OCH(arg gin arg arg his ala gly ile 
361/121 391/131*SEQ ID NO: 493 

CAC GTC GGG CAC CGG ACA GGG TAA CGC TCG GCA ACC AAG GAC GGC AGC CAC TAC AAG ATC 
his val gly his arg thr gly)OCH(arg ser ala thr lys asp gly ser his tyr lys ile) 

^ — SEQ ID NO: 494 

SEQ ID NOS: 489-494 

FIG. 22A 



^ SEQ 


ID NO: 495 












32/11 


















TAC GAC 


AAG 


GCA 


AAG 


GAG 


CAC 


AGG 


GTG 


AAG 


CGT GGA 


CTG 


ACG 


GTC 


GCG 


GTA 


GCC 


GGA 


GCC 


(tyr asp 


lys 


ala 


lys 


glu 


his 


arg 


val 


lys 


arg gly 


leu 


thr 


val 


ala 


val 


ala 


gly 


ala 


62/21^-^ 


SEQ 


ID NO: 496 










92/31 


















GCC ATT 


CTG 


GTC 


GCA 


GGT 


CTT 


TCC 


GGA 


TGT 


TCA AGC 


AAC 


AAG 


TCG 


ACT 


ACA 


GGA 


AGC 


GGT 


ala ile 


leu 


val 


ala 


gly 


leu 


ser 


gly 


cys 


ser ser 


asn 


lys 


ser 


thr 


thr 


gly 


ser 


gly 


122/41 


















152/51 


















GAG ACC 


ACG 


ACC 


GCG 


GCA 


GGC 


ACG 


ACG 


GCA 


AGC CCC 


GGC 


GCC 


GCA 


TCC 


GGG 


CCG 


AAG 


GTC 


glu thr 


thr 


thr 


ala 


ala 


gly 


thr 


thr 


ala 


ser pro 


gly 


ala 


ala 


ser 


gly 


pro 


lys 


val 


182/61 


















212/71 


















GTC ATC 


GAC 


GGT 


AAG 


GAC 


CAG 


AAC 


GTC 


ACC 


GGG TCT 


GTG 


GTG 


TGC 


ACA 


ACC 


GCG 


GCC 


GGC 


val ile 


asp 


gly 


lys 


asp 


gin 


asn 


val 


thr 


gly ser 


val 


val 


cys 


thr 


thr 


ala 


ala 


gly 


242/81 


















272/91 


















AAT GTC 


AAC 


ATC 


GCG 


ATC 


GGC 


GGG 


GCG 


GCG 


ACC GGC 


ATT 


GCC 


GCC 


GTG 


CTC 


ACC 


GAC 


GGC 


asn val 


asn 


ile 


ala 


ile 


gly 


gly 


ala 


ala 


thr gly 


ile 


ala 


ala 


val 


leu 


thr 


asp 


gly 


302/101 


















332/111 


















AAC CCT 


CCG 


GAG 


GTG 


AAG 


TCC 


GTT 


GGG 


CTC 


GGT AAC 


GTC 


AAC 


GGC 


GTC 


ACG 


CTG 


GGA 


TAC 


asn pro 


pro 


glu 


val 


lys 


ser 


val 


gly 


leu 


gly asn 


val 


asn 


gly 


val 


thr 


leu 


gly 


tyr 


362/121 


















392/131 


















ACG TCG 


GGC 


ACC 


GGA 


CAG 


GGT 


AAC 


GCT 


CGG 


CAA CCA 


AGG 


ACG 


GCA 


GCC 


ACT 


ACA 


AGA 


TC 


thr ser 


gly 


thr 


gly 


gin 


gly 


asn 


ala 


arg 


gin pro 


arg 


thr 


ala 


ala 


thr 


thr 


arg) 



SEQ ZD NOS: 495-496 



FIG. 22B 



SEQ ID NO: 497 




























ACG ACA 


AGG 


CAA 


AGG 


AGC 


APA 


GGG 


TGA 






TGA 


CGG 




CCjG 


TAG 


CCG 


GAG 


CCG 


(thr thr 


arg 


gin 


arg 


ser 


thr 


gly)OPA 


ser 


val asp 


OPA 


arg 


ser 


arg 


AMB (pro 


g lu 


pro 


63/21^ — 


-SEQ 


ZD NO: 498 










93 / 31 


SEQ ID 














CCA TTC 


TGG 


TCG 


CAG 


GTC 


TTT 




tv rp 

KjJ\ 1 












CTA 


CAG 


GAA 


GCG 


GTG 


nro oil© 




ser 


gin 


val 


phe 


pro 


asp 


va 1 




tnr 


ser 


arg 


leu 


gin 


glu 


ala 


Va X 


12 3/41 


















X -J J / J X 














AGA CCA 


CGA 


CCG 


CGG 


CAG 


GCA 


CGA 


CGG 










LAI 




GGC 


CGA 


AGG 


TCG 


arg pro 


arg 


pro 


arg 


gin 


ala 


arg 


arg 


gin 


ala pro 


ala 




his 


pro 


gly 


arg 


arg 


ser 


183/61 


















J / /X 


















TCA TCG 


ACG 


GTA 


AGG 


ACC 


AGA 


ACG 


TCA 


CCG 


Vjvj J. ± 


TGG 


TGT 




CAA 


CCG 


CGG 


CCG 


GCA 


ser ser 


thr 


val 


arg 


thr 


arg 


thr 


ser 


pro 




trp 


cys 


ala 


gin 


pro 


arg 


pro 


axa 


243/81 


















273/91 


















ATG TCA 


ACA 


TCG 


CGA 


TCG 


GCG 


GGG 


CGG 


CGA 


CCG GCA 


TTG 


CCG 


CCG 


TGC 


TCA 


CCG 


ACG 


GCA 


met ser 


thr 


ser 


arg 


ser 


ala 


giy 


arg 


arg 


pro ala 


leu 


pro 


pro 


cys 


ser 


pro 


thr 


ala 


303/101 


















333/111 


















ACC CTC 


CGG 


AGG 


TGA 


AGT 


CCG 


TTG 


GGC 


TCG 


GTA ACG 


TCA 


ACG 


GCG 


TCA 


CGC 


TGG 


GAT 


ACA 


thr leu 


arg 


arg)OPA(ser 


pro 


leu 


gly 


ser 


val thr 


ser 


thr 


ala 


ser 


arg 


trp 


asp 


thr 


363/121 








^-SEQ ID NO: 


500 


393/131 


















CGT CGG 


GCA 


CCG 


GAC 


AGG 


GTA 


ACG 


CTC 


GGC 


AAC CAA 


GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 


arg arg 


ala 


pro 


asp 


arg 


val 


thr 


leu 


gly 


asn gin 


gly 


arg 


gin 


pro 


leu 


gin 


asp) 





SEQ ID NOS: 497-500 

FIG. 22C 



SEQ ID NO: 501 31/11 



GCA CAA 


CCG 


CGG 


CCG 


GCA 


ATG 


TCA 


ACA 


TCG 


CGA TCG 


GCG 


GGG 


CGG 


CGA 


CCG 


GCA 


TTG 


CCG 


(ala gin 


pro 


arg 


pro 


ala 


met 


ser 


thr 


ser 


arg ser 


ala 


gly 


arg 


arg 


pro 


ala 


leu 


pro 


61/2 1'^- 


— SEQ ID 


NO: 


502 










91/31 


















CCG TGC 


TCA 


CCG 


ACG 


GCA 


ACC 


CTC 


CGG 


AGG 


TGA AGT 


CCG 


TTG 


GGC 


TCG 


GTA 


ACG 


TCA 


ACG 


pro cys 


ser 


pro 


thr 


ala 


thr 


leu 


arg 


arg)OPA(ser 


pro 


leu 


gly 


ser 


val 


thr 


ser 


thr 


121/41 


















151/51"^ 


SEQ ID NO: 


503 








GCG TCA 


CGC 


TGG 


GAT 


ACA 


CGT 


CGG 


GCA 


CCG 


GAC AGG 


GTA 


ACG 


CCT 


CGG 


CAA 


CCA 


AGG 


ACG 


ala ser 


arg 


trp 


asp 


thr 


arg 


arg 


ala 


pro 


asp arg 


val 


thr 


pro 


arg 


gin 


pro 


arg 


thr 


181/61 


















211/71 


















GCA GCC 


ACT 


ACA 


AGA 


TCA 


CAG 


GGT 


GAA 


GCG 


TGG ACT 


GAC 


GGT 


CGC 


GGT 


AGC 


CGG 


AGC 


CGC 


ala ala 


thr 


thr 


arg 


ser 


gin 


giy 


glu 


ala 


trp thr 


asp 


giy 


arg 


giy 


ser 


arg 


ser 


arg 


241/81 


















271/91 


















CAT TCT 


GGT 


CGC 


AGG 


TCT 


TTC 


CGG 


ATG 


TTC 


AAG CAA 


CAA 


GTC 


GAC 


TAC 


AGG 


AAG 


CGG 


TGA 


his ser 


gly 


arg 


arg 


ser 


phe 


arg 


met 


phe 


lys gin 


gin 


val 


asp 


tyr 


arg 


lys 


arg) OPA 


301/101 


















331/111 


















GAC CAC 


GAC 


CGC 


GGC 


AGG 


CAC 


GAC 


GGC 


AAG 


CCC CGG 


CGC 


CGC 


TCC 


GGG 


CCG 


AAG 


GTC 


GTC 


(asp his 


asp 


arg 


gly 


arg 


his 


asp 


gly 


lys 


pro arg 


arg 


arg 


ser 


giy 


pro 


lys 


val 


val 


361/121 


^SEQ ID NO: 


504 








391/131 


















ATC GAC 


GGT 


AAG 


GAC 


CAG 


AAC 


GTC 


ACC 


GGC 


TCC GTG 


GTG 


TGC 


ACA 


ACC 


GCG 


GCC 


GGC 


AAT 


ile asp 


gly 


lys 


asp 


gin 


asn 


val 


thr 


gly 


ser val 


val 


cys 


thr 


thr 


ala 


ala 


giy 


asn 


421/141 


















451/151 


















GTC AAC 


ATC 


GCG 


ATC 


GGC 


GGG 


GCG 


GCG 


ACC 


GGC ATT 


GCC 


GCC 


GTG 


CTC 


ACC 


GAC 


GGC 


AAC 


val asn 


ile 


ala 


ile 


gly 


gly 


ala 


ala 


thr 


gly ile 


ala 


ala 


val 


leu 


thr 


asp 


gly 


asn 


481/161 


















511/171 


















CCT CCG 


GAG 


GTG 


AAG 


TCC 


GTT 


GGG 


CTC 


GGT 


AAC GTC 


AAC 


GGC 


GTC 


ACG 


CTG 


GGA 


TAC 


ACG 


pro pro 


glu 


val 


lys 


ser 


val 


gly 


leu 


gly 


asn val 


asn 


giy 


val 


thr 


leu 


giy 


tyr 


thr 


541/181 


















571/191 


















TCG GGC 


ACC 


GGA 


CAG 


GGT 


AAC 


GCC 


TCG 


GCA 


ACC AAG 


GAC 


GGC 


AGC 


CAC 


TAC 


AAG 


ATC 




ser gly 


thr 


gly 


gin 


gly 


asn 


ala 


ser 


ala 


thr lys 


asp 


giy 


ser 


his 


tyr 


lys 


ile) 





SEQ XD NOS: 501-504 



FIG. 23A 





' SEQ 


ZD NO: 505 










32/11 


















PAP AAP 


CGC 


GGC 


CGG 


CAA 




pa a 


PAT 




GAT CGG 


CGG 


GGC 


GGC 


GAC 


CGG 


CAT 






(his asn 


arg 


gly 


arg 


gin 


cys 


gin 


hi s 


». J. y 


asp arg 


arg 


gly 


gly 


asp 


arg 


his 


cys 


arg 


62/21 — 


"SEQ 


JD NO: 506 










92/31 


















CGT GCT 


CAC 


CGA 


CGG 


CAA 


ppp 


T'PP 




p,prn 


GAA GTC 


CGT 


TGG 


GCT 


CGG 


TAA 


CGT 


LAA 




arg 


his 


arg 


arg 


gin 


pro 


ser 


(~r 1 ■»/■ 

y j-y 


gly 


glu val 


arg 


trp 


ala 


arg)OCH(arg 


g In 


arg 


122/41 


















152/51 


SEQ XD 


NO: 


507- 










CGT CAC 


GCT 


GGG 


ATA 


CAC 


GTC 


GGG 


CAC 


CGG 


ACA GGG 


TAA 


CGC 


CTC 


GGC 


AAC 


CAA 






arcr his 


ala 


crlv 
y - 1 - y 


ile 


his 


val 




hi s 


ci x. y 


thr gly)OCH(arg 


leu 


gly 


asn 


gin 


gly 


arg 


182/61 


















212 /71 




^■SEQ ID NO: 


508 






PAG PPA 


CTA 


CAA 


GAT 


CAC 








p/im 
\j J. 


GGA CTG 


ACG 


GTC 


GCG 


GTA 


GCC 


GGA 








J- cu 


gin 


asp 


his 


arg 


val 


lys 


arg 


gly leu 


thr 


val 


ala 


val 


ala 


gly 


a J_a 


axa 


242/81 


















272/91 






















GCA 




prprp 
v_ i. X 










AGC AAC 


AAG 


TCG 


ACT 


ACA 


GGA 


AGC 


GGT 


GAG 


1 1 e leU 


vai 


ala 


CT~\ "V 


1 eu 


ser 


9"iy 


cys 


ser 


ser asn 


lys 


ser 


thr 


thr 


gly 


ser 


gly 


glu 


\J f -L _L 


















332/111 


















app app 


ACC 


GCG 


GCA 


GGC 


ACG 


ACG 


GCA 




CCC GGC 


GCC 


GCT 


CCG 


GGC 


CGA 


AGG 


TCG 


TCA 


t-Vir t-VtT- 

L.I1J- L.X1X 


thr 


ala 


ala 


gly 


thr 


thr 


ala 


ser 


pro gly 


ala 


ala 


pro 


gly 


arg 


arg 


ser 


ser 


™J \J / J- JL 


















392/131 


















TCG ACG 


GTA 


AGG 


ACC 


AGA 


ACG 


TCA 


CCG 


GCT 


CCG TGG 


TGT 


GCA 


CAA 


CCG 


CGG 


CCG 


GCA 


ATG 


ser thr 


val 


arg 


thr 


arg 


thr 


ser 


pro 


ala 


pro trp 


cys 


ala 


gin 


pro 


arg 


pro 


ala 


met 


422/141 


















452/151 


















TCA AC A 


TCG 


CGA 


TCG 


GCG 


GGG 


CGG 


CGA 


CCG 


GCA TTG 


CCG 


CCG 


TGC 


TCA 


CCG 


ACG 


GCA 


ACC 


ser thr 


ser 


arg 


ser 


ala 


gly 


arg 


arg 


pro 


ala leu 


pro 


pro 


cys 


ser 


pro 


thr 


ala 


thr 


482/161 


















512/171 


















CTC CGG 


AGG 


TGA 


AGT 


CCG 


TTG 


GGC 


TCG 


GTA 


ACG TCA 


ACG 


GCG 


TCA 


CGC 


TGG 


GAT 


ACA 


CGT 


leu arg 


arg)OPA 


(ser 


pro 


leu 


gly 


ser 


val 


thr ser 


thr 


ala 


ser 


arg 


trp 


asp 


thr 


arg 


542/181 








-SEQ 


ID NO: 509 




572/191 


















CGG GCA 


CCG 


GAC 


AGG 


GTA 


ACG 


CCT 


CGG 


CAA 


CCA AGG 


ACG 


GCA 


GCC 


ACT 


ACA 


AGA 


TC 




arg ala 


pro 


asp 


arg 


val 


thr 


pro 


arg 


gin 


pro arg 


thr 


ala 


ala 


thr 


thr 


arg) 





SEQ ID NOS: 505-509 

FIG. 23B 



MAR 2 6 2003 w' 



SEQ ID NO: 510 33/11 



ACA ACC 


GCG 


GCC 


GGC 


AAT 


GTC 


AAC 


ATC 


GCG 


ATC GGC 


GGG 


GCG 


GCG 


ACC 


GGC 


ATT 


GCC 


GCC 


(thr thr 


ala 


ala 


gly 


asn 


val 


asn 


ile 


ala 


ile gly 


gly 


ala 


ala 


thr 


gly 


ile 


ala 


ala 


63/21 ^ 


SEQ 


ZD NO: 511 










93/31 


















GTG CTC 


ACC 


GAC 


GGC 


AAC 


CCT 


CCG 


GAG 


GTG 


AAG TCC 


GTT 


GGG 


CTC 


GGT 


AAC 


GTC 


AAC 


GGC 


val leu 


thr 


asp 


gly 


asn 


pro 


pro 


glu 


val 


lys ser 


val 


gly 


leu 


gly 


asn 


val 


asn 


gly 


123/41 


















153/51 


















GTC ACG 


CTG 


GGA 


TAC 


ACG 


TCG 


GGC 


ACC 


GGA 


CAG GGT 


AAC 


GCC 


TCG 


GCA 


ACC 


AAG 


GAC 


GGC 


val thr 


leu 


gly 


tyr 


thr 


ser 


gly 


thr 


gly 


gin gly 


asn 


ala 


ser 


ala 


thr 


lys 


asp 


gly 


183/61 


















213/71 


















AGC CAC 


TAC 


AAG 


ATC 


ACA 


GGG 


TGA 


AGC 


GTG 


GAC TGA 


CGG 


TCG 


CGG 


TAG 


CCG 


CAG 


CCG 


CCA 


ser his 


tyr 


lys 


ile 


thr 


gly)OPA 


ser 


val 


asp OPA 


arg 


ser 


arg 


AMB (pro 


glu 


pro 


pro 


243/81 


















273/91 


SEQ ID 


NO: 


512> 








TTC TGG 


TCG 


CAG 


GTC 


TTT 


CCG 


GAT 


GTT 


CAA 


GCA ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 


AGA 


phe trp 


ser 


gin 


val 


phe 


pro 


asp 


val 


gin 


ala thr 


ser 


arg 


leu 


gin 


glu 


ala 


val 


arg 


303/101 


















333/111 


















CCA CGA 


CCG 


CGG 


CAG 


GCA 


CGA 


CGG 


CAA 


GCC 


CCG GCG 


CCG 


CTC 


CGG 


GCC 


GAA 


GGT 


CGT 


CAT 


pro arg 


pro 


arg 


gin 


ala 


arg 


arg 


gin 


ala 


pro ala 


pro 


leu 


arg 


ala 


glu 


gly 


arg 


his 


363/121 


















393/131 


















CGA CGG 


TAA 


GGA 


CCA 


GAA 


CGT 


CAC 


CGG 


CTC 


CGT GGT 


GTG 


CAC 


AAC 


CGC 


GGC 


CGG 


CAA 


TGT 


arg arg)OCH(gly 


pro 


glu 


arg 


his 


arg 


leu 


arg gly 


val 


his 


asn 


arg 


gly 


arg 


gin 


cys 


423/141 




^ SEQ ID NO: 513 






453/151 


















CAA CAT 


CGC 


GAT 


CGG 


CGG 


GGC 


GGC 


GAC 


CGG 


CAT TGC 


CGC 


CGT 


GCT 


CAC 


CGA 


CGG 


CAA 


CCC 


gin his 


arg 


asp 


arg 


arg 


gly 


gly 


asp 


arg 


his cys 


arg 


arg 


ala 


his 


arg 


arg 


gin 


pro 


483/161 


















513/171 


















TCC GGA 


GGT 


GAA 


GTC 


CGT 


TGG 


GCT 


CGG 


TAA 


CGT CAA 


CGG 


CGT 


CAC 


GCT 


GGG 


ATA 


CAC 


GTC 


ser gly 


gly 


glu 


val 


arg 


trp 


ala 


arg)OCH(arg gin 


arg 


arg 


his 


ala 


gly 


ile 


his 


val 


543/181 








SEQ 


ID NO: 514 — 




573/191 


















GGG CAC 


CGG 


ACA 


GGG 


TAA 


CGC 


CTC 


GGC 


AAC 


CAA GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 




gly his 


arg 


thr 


gly)OCH(arg 


leu 


gly 


asn 


gin gly 


arg 


gin 


pro 


leu 


gin 


asp) 





SEQ ID NO: 515 



SEQ ID NOS: 510-515 

FIG. 23C 




31/11 



ota apc; 


ACA 


GGC 


AAA 


GGA 










(jLQj I CjCj 


ACT 


GAC 


GGT 


CGC 


GGT 


AGC 


CGG 


AGC 


\ J_ \3 LA U lli. 


thr 


gly 


lys 


gly 


ala 

aid 


gin 


gly 


glu. 


ala trp 


4- V» 

tnr 


asp 


giy 


arg 


gly 


ser 


arg 


ser 


61/217"""""- 


SEQ 


XD NO: 517 










31/ J X 


















CGC CAT 


TCT 


GGT 


err* 

LVjL 




J. V_ J. 


J- x v_ 








y"i A A 


**** A A 

LAA 


GTC 


GAC 


TAC 


AGG 


AAG 


CGG 


arg hlS 


ser 


gly 


arg 


arg 


ser 


phe 


arg 


met 


phe lys 


gin 


gin 


val 


asp 


tyr 


arg 


lys 


arg) 


121/41 


















1 CI /CI 

1j1 /OX 


















J. vjn unv. 


CAC 


GAC 


CGC 


GGC 




LAI 






AAG CCC 


CGG 


CGC 


CGC 


TCC 


GGG 


CCG 


AAG 


GTC 




his 


asp 


arg 


gly 


arg 


his 


asp 


giy 


lys pro 


arg 


arg 


arg 


ser 


gly 


pro 


lys 


val 


181 / 61^*" 


— SEQ ID 


NO: 


518 










4 XX / / I 


















GTC ATC 


GAC 


GGT 


AAG 


GAC 


CAG 


AAC 


GTC 


Apr* 








1 


A C* A 

ALA 


ALL 


LsLVj 


/—i /-I z'-i 


r-* 


val ile 


asp 


gly 


lys 


asp 


gin 


asn 


val 


thr 


gly ser 


val 


val 


cys 


thr 


thr 


ala 


ala 


gly 


241/81 


















271/91 
















AAT GTC 


AAC 


ATC 


GCG 


ATC 


GGC 


GGG 


GCG 


GCG 


ACC GGC 


ATT 


GCC 


GCC 


GTG 


CTC 


ACC 


GAC 


GGC 


asn val 


asn 


ile 


ala 


ile 


gly 


gly 


ala 


ala 


thr gly 


ile 


ala 


ala 


val 


leu 


thr 


asp 


gly 


301/101 


















331/111 


















AAC CCT 


CCG 


GAG 


GTG 


AAG 


TCC 


GTT 


GGG 


CTC 


GGT AAC 


GTC 


AAC 


GGC 


GTC 


ACG 


CTG 


GGA 


TAC 


asn pro 


pro 


glu 


val 


lys 


ser 


val 


gly 


leu 


gly asn 


val 


asn 


gly 


val 


thr 


leu 


gly 


tyr 


361/121 


















391/131 














ACG TCG 


GGC 


ACC 


GGA 


CAG 


GGT 


AAC 


GCC 


TCG 


GCA ACC 


AAG 


GAC 


GGC 


AGC 


CAC 


TAC 


AAG 


ATC 


thr ser 


gly 


thr 


gly 


gin 


gly 


asn 


ala 


ser 


ala thr 


lys 


asp 


gly 


ser 


his 


tyr 


lys 


ile) 



SEQ XD NOS: 516-518 

FIG. 24A 



^ SEQ ID NO: 519 32/11 

TAA CGA CAG GCA AAG GAG CAC AGG GTG AAG CGT GGA CTG ACG GTC GCG GTA GCC GGA GCC 

0CH(arg gin ala lys glu his arg val lys arg gly leu thr val ala val ala gly ala 

62/21 SEQ ID NO: 520 92/31 

GCC ATT CTG GTC GCA GGT CTT TCC GGA TGT TCA AGC AAC AAG TCG ACT ACA GGA AGC GGT 

ala ile leu val ala gly leu ser gly cys ser ser asn lys ser thr thr gly ser gly 
122/41 152/51 

GAG ACC ACG ACC GCG GCA GGC ACG ACG GCA AGC CCC GGC GCC GCT CCG GGC CGA AGG TCG 

glu thr thr thr ala ala gly thr thr ala ser pro gly ala ala pro gly arg arg ser 
182/61 212/71 

TCA TCG ACG GTA AGG ACC AGA ACG TCA CCG GCT CCG TGG TGT GCA CAA CCG CGG CCG GCA 

ser ser thr val arg thr arg thr ser pro ala pro trp cys ala gin pro arg pro ala 
242/81 272/91 

ATG TCA ACA TCG CGA TCG GCG GGG CGG CGA CCG GCA TTG CCG CCG TGC TCA CCG ACG GCA 

met ser thr ser arg ser ala gly arg arg pro ala leu pro pro cys ser pro thr ala 
302/101 332/111 

ACC CTC CGG AGG TGA AGT CCG TTG GGC TCG GTA ACG TCA ACG GCG TCA CGC TGG GAT ACA 

thr leu arg arg)OPA(ser pro leu gly ser val thr ser thr ala ser arg trp asp thr 

362/121 £ SEQ ID NO: 521 392/131 

CGT CGG GCA CCG GAC AGG GTA ACG CCT CGG CAA CCA AGG ACG GCA GCC ACT ACA AGA TC 

arg arg ala pro asp arg val thr pro arg gin pro arg thr ala ala thr thr arg) 



SEQ XD NOS: 519-521 



FIG. 24B 



MAR 2 6 2003 £ 



SEQ ID NO: 522 33/11 







CAA 


AGG 


AGC 


ACA 


GGG 


TGA 


AGC GTG GAC 


TGA 


CGG 


TCG 


CGG 


TAG 


CCG 


GAG 


CCG 


V dsn asp 


arg 


gin 


arg 


ser 


4- V» -v- 

cnr 


gly)OPA 


ser val asp 


OPA 


arg 


ser 


arg 


AMB (pro 


glu 


pro 




SEQ 


ZD NO: 523 








93/31 


SEQ 


ID NO: 524 












TGG 


TCG 


CAG 


GTC 


TTT 


CCG 


GAT 


GTT CAA GCA 


ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 


pi <-> plits 


trp 


ser 


gin 


val 


phe 


pro 


asp 


val gin ala 


thr 


ser 


arg 


leu 


gin 


glu 


ala 


val 


X^ ,3 / 41 X 
















153/51 


















CGA 


CCG 


CGG 


CAG 


GCA 


CGA 


CGG 


CAA GCC CCG 


GCG 


CCG 


CTC 


CGG 


GCC 


GAA 


GGT 


CGT 


arg pro 


arg 


pro 


arg 


gin 


ala 


arg 


arg 


gin ala pro 


ala 


pro 


leu 


arg 


ala 


glu 


gly 


arg 


IOj/ ox 
















213/71 




















CGG 


TAA 


GGA 


CCA 


GAA 


CGT 


CAC 


CGG CTC CGT 


GGT 


GTG 


CAC 


AAC 


CGC 


GGC 


CGG 


CAA 


his arg 


arg)OCH(gly 


pro 


glu 


arg 


his 




gly 


val 


his 


asn 


arg 


gly 


arg 


gin 


243/81 






SEQ ID NO: 


525 


273/91 


















TGT CAA 


CAT 


CGC 


GAT 


CGG 


CGG 


GGC 


GGC 


GAC CGG CAT 


TGC 


CGC 


CGT 


GCT 


CAC 


CGA 


CGG 


CAA 


cys gin 


his 


arg 


asp 


arg 


arg 


gly 


qly 


asp arg his 


cys 


arg 


arg 


ala 


his 


arg 


arg 


gin 


303/101 
















333/111 
















CCC TCC 


GGA 


GGT 


GAA 


GTC 


CGT 


TGG 


GCT 


CGG TAA CGT 


CAA 


CGG 


CGT 


CAC 


GCT 


GGG 


ATA 


CAC 


pro ser 


gly 


gly 


glu 


val 


arg 


trp 


ala 


arg)OCH(arg 


gin 


arg 


arg 


his 


ala 


gly 


ile 


his 


363/121 
















393/13T* 


SEQ ID NO 


: 526 








GTC GGG 


CAC 


CGG 


ACA 


GGG 


TAA 


CGC 


CTC 


GGC AAC CAA 


GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


C 


val gly 


his 


arg 


thr 


gly)OCH<arg 


leu 


gly asn gin 


gly 


arg 


gin 


pro 


leu 


gin 


asp) 





SEQ ID NO: 527 



SEQ XD NOS: 522-527 

FIG. 24C 



Direct primer 
SEQ ID NO: 528 5' ACG CGG CGC AGC CTG TTG 3' 



SEQ XD No. 528 

FIG. 25 



Reverse primer 
SEQ ID NO: 529 5* CGA CCT TGG GAT TCG CCT 3' 



SEQ XD No .529 



FIG. 26 




31/11 



r*c r T k arr 


AGC 


AAG 


AGC 


CCA 


GGG 


til 






ACL 1AA 


AAG 


GAG 


TAG 


CGC 


CCA 


TGG 


GCT 


TGA 


V ^-^-L w t_ J. i. X. 


ser 


lys 


ser 


pro 


gly 


1 eu 


hi s 


arg 


tinr ) ulh 


lys 


glu 


AMB(arg 


pro 


trp 


ala)OPA 


61 / 21 


SEQ ID NO: 


531 








y J- / J J. 








SEQ ID NO: 


532 




TTT 


CCT 


TCC 


GCC 


CCG 


1 vjL- 


A Am 
AA 1 


ALL 


ATL TGC 


AAG 


ACC 


AGC 


GAC 


GGC 


CCG 


TGG 


TTG 


t COT* a CiTt 
\ SCI OSll 


phe 


pro 


ser 


ala 


pro 


cys 


asn 




ile cys 


lys 


thr 


ser 


asp 


gly 


pro 


trp 


leu 


*T*5 1 / A 1 " * 
1Z1/ ft J. 


SEQ ID NO: 


533 












151/51 










LvjVj 1 vj 


LL?L 


AIjL- 


1 1 (j 


f-\ r~\ 

LCjCj 


AAA 




GGT 


ATG 


GAC CCT 


GCC 


GTA 


CCG 


TTG 


TTG 


CCA 


CTT 


GAT 


arg Ser 


arg 


ser 


leu 


arg 


lys 


arg 


giy 


met 


asp pro 


ala 


val 


pro 


leu 


leu 


pro 


leu 


asp 


1 O 1 /CI 

lo 1 / 0 1 


















211/71 




















GCT 


CTC 


CAC 


CCG 


TCG 


GGG 


GGC 


GAA 


AGC CAT 


TCC 


GAC 


ACT 


GGG 


ATC 


CTC 


AAA 


ACG 


va 1 va 1 


ala 


leu 


his 


pro 


ser 


gly 


gly 


giu 


ser his 


ser 


asp 


thr 


gly 


ile 


leu 


lys 


thr 


241/81 


















271/91 




















GAG 


TGT 


CTG 


CAG 


GGC 


TCC 


GGG 


GAG 


CAG CCG 


ATC 


ATC 


ACC 


ATG 


TAC 


GAA 


CTG 


AAT 


ser ala 


glu 


cys 


leu 


gin 


gly 


ser 


gly 


glu 


gin pro 


ile 


ile 


thr 


met 


tyr 


glu 


leu 


asn 


301/101 


















331/111 


















AAG TCC 


ccc 


CCG 


CGC 


GAC 


TTC 


CAG 


ACA 


TTT 


GTT GTG 


GTT 


TCG 


GTT 


GAG 


GCC 


GAG 


GCG 


AGG 


lys ser 


pro 


pro 


arg 


asp 


phe 


gin 


thr 


phe 


val val 


val 


ser 


val 


glu 


ala 


glu 


ala 


arg 


361/121 


















391/131 


















CTC ATT 


TCG 


CAG 


CAA 


GCG 


GTC 


TCC 


GGG 


TCG 


CAG CAT 


CGT 


TGC 


GGC 


GAT 


CGC 


GGC 


GCA 


GTC 


leu ile 


ser 


gin 


gin 


ala 


val 


ser 


gly. 


ser 


gin his 


arg 


cys 


gly 


asp 


arg 


gly 


ala 


val 


421/141 




































GTC GGA 


CGA 


GTC 


GTC 


GTC 


AAC 


GAC 


CAC 


GAT 


C 


















val gly 


arg 


val 


val 


val 


asn 


asp 


his 


asp) 



















SEQ ID NOS: 530-533 

FIG. 27A 



-SEQ ID 


NO: 


534 












31/11 


















CTA CCA 


GCA 


AGA 


GCC 


CAG 


GGC 


TTC 


ACA 


GGA 


CCT AAA 


AGG 


AGT 


AGC 


GCC 


CAT 


GGG 


CTT 


GAT 


(leu pro 


ala 


arg 


ala 


gin 


gly 


phe 


thr 


giy 


pro lys 


arg 


ser 


ser 


ala 


his 


giy 


leu 


asp 


61/2l>— 


SEQ 


ID NO: 535 










91/31 


















CCA ATT 


TTC 


CTT 


CCG 


CCC 


CGT 


GCA 


ATA 


CCA 


TCT GCA 


AGA 


CCA 


GCG 


ACG 


GCC 


CGT 


GGT 


TGC 


pro ile 


phe 


leu 


pro 


pro 


arg 


ala 


ile 


pro 


ser ala 


arg 


pro 


ala 


thr 


ala 


arg 


gly 


cys 


121/41 


















151/51 


















GGT CGC 


GCA 


GCT 


TGC 


GGA 


AAC 


GGG 


GTA 


TGG 


ACC CTG 


CCG 


TAC 


CGT 


TGT 


TGC 


CAC 


TTG 


ATG 


gly arg 


ala 


ala 


cys 


gly 


asn 


gly 


val 


trp 


thr leu 


pro 


tyr 


arg 


cys 


cys 


his 


leu 


met 


181/61 


















211/71 


















TCG TCG 


CTC 


TCC 


ACC 


CGT 


CGG 


GGG 


GCG 


AAA 


GCC ATT 


CCG 


ACA 


CTG 


GGA 


TCC 


TCA 


AAA 


CGT 


ser ser 


leu 


ser 


thr 


arg 


arg 


giy 


ala 


lys 


ala ile 


pro 


thr 


leu 


gly 


ser 


ser 


lys 


arg 


241/81 


















271/91 


















CGG CTG 


AGT 


GTC 


TGC 


AGG 


GCT 


CCG 


GGG 


AGC 


AGC CGA 


TCA 


TCA 


CCA 


TGT 


ACG 


AAC 


TGA 


ATA 


arg leu 


ser 


val 


cys 


arg 


ala 


pro 


gly 


ser 


ser arg 


ser 


ser 


pro 


cys 


thr 


asn)OPA( ile 


301/101 


















331/111 






SEQ ID NO: 


536- — 




AGT CCC 


CCC 


CGC 


GCG 


ACT 


TCC 


AGA 


CAT 


TTG 


TTG TGG 


TTT 


CGG 


TTG 


AGG 


CCG 


AGG 


CGA 


GGC 


ser pro 


pro 


arg 


ala 


thr 


ser 


arg 


his 


leu 


leu trp 


phe 


arg 


leu 


arg 


pro 


arg 


arg 


giy 


361/121 


















391/131 


















TCA TTT 


CGC 


AGC 


AAG 


CGG 


TCT 


CCG 


GGT 


CGC 


AGC ATC 


GTT 


GCG 


GCG 


ATC 


GCG 


GCG 


CAG 


TCG 


ser phe 


arg 


ser 


lys 


arg 


ser 


pro 


gly 


arg 


ser ile 


val 


ala 


ala 


ile 


ala 


ala 


gin 


ser 


421/141 




































TCG GAC 


GAG 


TCG 


TCG 


TCA 


ACG 


ACC 


ACG 


ATC 




















ser asp 


glu 


ser 


ser 


ser 


thr 


thr 


thr 


ile) 





















SEQ ID NOS: 534-536 



FIG. 27B 



SEQ 


ID NO: 537 












-1-3/11 


















rpar par; 


CAA 


GAG 


CCC 


AGG 


GCT 


1 K~r\ 






AAA 




Cj 1 A 


GCG 


CCC 


ATG 


GGC 


TTG 


ATC 




gin 


glu 


pro 


arg 


ala 


ser 


gin 


asp 


leu xys 


gly 


vax 


ala 


pro 


met 


gly 


leu 


ile 




SEQ ID NO : 


538 


























t_AA ill 


TCC 


TTC 


CGC 


CCC 


GTG 


LAA 


1 AC- 


CAT 


L1G CAA 


GAC 


CAG 


CGA 


CGG 


CCC 


GTG 


GTT 


GCG 


gin phe 


ser 


phe 


arg 


pro 


val 


gin 


tyr 


nis 


leu gin 


asp 


gin 


arg 


arg 


pro 


val 


val 


ala 


123/41 


















1 c: ~i /ci 


















GTC GCG 


CAG 


CTT 


GCG 


GAA 


ACG 


/~i /'-l /-i 


TAT 


CjGA 


CCC TGC 


CGT 


ACC 


GTT 


GTT 


GCC 


ACT 


TGA 


TGT 


val ala 


gin 


leu 


ala 


glu 


thr 




tyr 


gly 


pro cys 


arg 


unr 


val 


val 


ala 


thr ) OPA(cys 


183/61 


















O 1 / "7 1 






SEQ ID NO: 


539-—^ 




CGT CGC 


TCT 


CCA 


CCC 


GTC 


GGG 






PU\Kj 


Ct-A T.TC 


CGA 


CAC 


TGG 


GAT 


CCT 


CAA 


AAC 


GTC 


arg arg 


ser 


pro 


pro 


val 


gly 




arg 


lys 


pro phe 


arg 


his 


trp 


asp 


pro 


gin 


asn 


val 


243/81 


















OT\ / Q1 
*5 / J / si 


















GGC TGA 


GTG 


TCT 


GCA 


GGG 


CTC 






VjLA 


VjC-L. OA I 


CAT 


CAC 


CAT 


GTA 


CGA 


ACT 


GAA 


TAA 


gly)OPA(val 


ser 


ala 


gly 


leu 


arg 


gly 


d Xd 


ala asp 


hi s 


his 


his 


val 


arg 


thr 


glu)OCH 


303/10r* 


" SEQ ID NO: 540 








•JT-J / I 1 1 
J JJ/ 111 


















GTC CCC 


CCC 


GCG 


CGA 


CTT 


CCA 


GAC 


ATT 


TGT 


TGT GGT 


TTC 


GGT 


TGA 


GGC 


CGA 


GGC 


GAG 


GCT 


(val pro 


pro 


ala 


arg 


leu 


pro 


asp 


ile 


cys 


cys gly 


phe 


gly)OPA(gly 


arg 


gly 


glu 


ala 


363/121 


SEQ ID 


NO: 


541 








393/131 










SEQ 


ID NO: 542 


CAT TTC 


GCA 


GCA 


AGC 


GGT 


CTC 


CGG 


GTC 


GCA 


GCA TCG 


TTG 


CGG 


CGA 


TCG 


CGG 


CGC 


AGT 


CGT 


his phe 


ala 


ala 


ser 


gly 


leu 


arg 


val 


ala 


ala ser 


leu 


arg 


arg 


ser 


arg 


arg 


ser 


arg 


423/141 




































CGG ACG 


AGT 


CGT 


CGT 


CAA 


CGA 


CCA 


CGA 


TC 




















arg thr 


ser 


arg 


arg 


gin 


arg 


pro 


arg) 





















SEQ ID NOS: 537-541 

FIG. 27C 



SEQ ID NO: 543 MKTGT ATTRRRLL AVL I AL AL PG AAVAL L AE P S ATG AS D PC AAS EVAR 
TVG S VAK SMGD YL DSH PETNQ VMT AVLQ QQ VG PG S VA S L KAH F E AN P K 
VASDLHALSQPLTDLSTRCSLPISGLQAIGLMQAVQGARR 



SEQ ID No .543 



FIG. 28 



y-SEQ ID NO: 544 

GTGGGCAAGC AGCTAGCCGC GCTCGCCGCG CTGGTCGGTG CGTGCATGCT CGCAGCCGGA 60 

TGCACCAACG TGGTCGACGG GACCGCCGTG GCTGCCGACA AATCCGGACC ACTGCATCAG 120 

GATCCGATAC CGGTTTCAGC GCTTGAAGGG CTGCTTCTCG ACTTGAGCCA GATCAATGCC 180 

GCGCTGGGTG CGACATCGAT GAAGGTGTGG TTCAACGCCA AGGCAATGTG GGACTGGAGC 240 

AAGAGCGTGG CCGACAAGAA TTGCCTGGCT ATCGACGGTC CAGCACAGGA AAAGGTCTAT 3 00 

GCCGGCACCG GGTGGACCGC TATGCGCGGC CAACGGCTGG ATGACAGCAT CGATG AC TCC 3 60 

AAGAAACGCG ACCACTACGC CATTCAAGCG GTCGTCGGCT TCCCGACCGC ACATGATGCC 42 0 

GAGGAGTTCT ACAGCTCCTC GGTGCAAAGC TGGAGCAGCT GCTCGAACCG CCGGTTTGTC 480 

GAAGTCACCC CCGGACAGGA CGACGCCGCC TGGACTGTGG CTGACGTTGT CAACGACAAC 540 

GGCATGCTCA GTAGCTCGCA GGTTCAGGAA GGCGGCGACG GATGGACCTG CCAGCGTGCC 600 

CTGACTGCGC GCAACAACGT CACTATCGAC ATTGTCACGT GCGCCTATAG CCAACCGGAT 660 

TTGGTGGCGA TTGGCATCGC TAACCAAATC GCGGCCAAGG TTGCTAAGCA GTAG 714 



SEQ ID No. 544 



FIG. 29 




SEQ ZD NO: 545 



MGKQLAALAALVGACMLAAGCTNWDGTAVAADKSGPLHQDPIPVFTSALEGLLLDLSQINAALGATS 
MKVWFNAKAMWDWS K S VADKNC LA I DG PAQEKVYAGTG FTWTAMRGQRLDDS I DDS KKRDHYA I Q A W 
GF PTAHDAEEF Y S S S VQ S WS SC SNRRFVEVTFT PGQDDAAWTVADWNDNGML S S SQVQEGGDG WTCQ 
RALTARNNVTIDIVTCAYSQPDLVFTAIGIANQIAAKVAKQ 



SEQ ID No. 545 

FIG. 30 

•SEQ ID NO: 546 31/11 
AGG CGA ATA CCC GCG AGG GCA GCG CGA CGG CGG CCC TGC CGG CGC CGT GGC TGC TGA ACA 
(arg arg ile pro ala arg ala ala arg arg arg pro cys arg arg arg gly cys)OPA(thr 

61/21 SEQ ID NO: 547 91/31 SEQ ID NO: 548 

ACA CAT CCC AGC CGC GCA CGC TTC CGG TAT GCG GCA GGA TAA ACG ACC CCA ACA GCA CGA 
thr his pro ser arg ala arg phe arg tyr ala ala gly)OCH(thr thr pro thr ala arg 

121/41 151/51 t SEQ ID NO: 549 

ACA CCA GGA TTG CGA CAA CCA AAG CCC TCG CGC CTG GCT CGA TTT CGC GCG CAA CGC GGC 

thr pro gly leu arg gin pro lys pro ser arg leu ala arg phe arg ala gin arg gly 

181/61 211/71 

GTT CTG CCG CCT CGA TCT CAG CGC GGA GGG CGT CGA GAT C 

val leu pro pro arg ser gin arg gly gly arg arg asp) 

SEQ XD NOS: 546-549 

FIG. 31A 



1/1 ^SEQ ID NO 


: 550 










GGC GAA 


TAC 


CCG 


CGA 


GGG 


CAG 


CGC 


GAC 


GGC 


(gly giu 


tyr 


pro 


arg 


gly 


gin 


arg 


asp 


gly 


61/21^- 


SEQ 


ID NO: 551 










CAC ATC 


CCA 


GCC 


GCG 


CAC 


GCT 


TCC 


GGT 


ATG 


his ile 


pro 


ala 


ala 


his 


ala 


ser 


gly 


met 


121/41 


















CAC CAG 


GAT 


TGC 


GAC 


AAC 


CAA 


AGC 


CCT 


CGC 


his gin 


asp 


cys 


asp 


asn 


gin 


ser 


pro 


arg 


181/61 


















TTC TGC 


CGC 


CTC 


GAT 


CTC 


AGC 


GCG 


GAG 


GGC 


phe cys 


arg 


leu 


asp 


leu 


ser 


ala 


glu 


gly 



31/11 

GGC CCT GCC GGC GCC GTG GCT GCT GAA CAA 
gly pro ala gly ala val ala ala glu gin 
91/31 

CGG CAG GAT AAA CGA CCC CAA CAG CAC GAA 
arg gin asp lys arg pro gin gin his glu 
151/51 

GCC TGG CTC GAT TTC GCG CGC AAC GCG GCG 

ala trp leu asp phe ala arg asn ala ala 

211/71 

GTC GAG ATC 

val glu ile) 



SEQ XD NOS: 550-551 



FIG. 31 B 



MAR 2 6 2003 SI 




1/1 


-SEQ 


ID NO: 552 










GCG AAT 


ACC 


CGC 


GAG 


GGC 


AGC 


GCG 


ACG 


GCG 


(ala asn 


thr 


arg 


glu 


gly 


ser 


ala 


thr 


ala 


61/21^- 


-SEQ 


ID NO: 553 










ACA TCC 


CAG 


CCG 


CGC 


ACG 


CTT 


CCG 


GTA 


TGC 


thr ser 


gin 


pro 


arg 


thr 


leu 


pro 


val 


cys 


121/41 
















ACC AGG 


ATT 


GCG 


ACA 


ACC 


AAA 


GCC 


CTC 


GCG 


thr arg 


ile 


ala 


thr 


"thr 


lys 


ala 


leu 


ala 


181/61 


















TCT GCC 


GCC 


TCG 


ATC 


TCA 


GCG 


CGG 


AGG 


GCG 


ser ala 


ala 


ser 


ile 


ser 


ala 


arg 


arg 


ala 



31/11 

GCC CTG CCG GCG CCG TGG CTG CTG AAC AAC 
ala leu pro ala pro trp leu leu asn asn 
91/31 

GGC AGG ATA AAC GAC CCC AAC AGC ACG AAC 
gly arg ile asn asp pro asn ser thr asn 
151/51 

CCT GGC TCG ATT TCG CGC GCA ACG CGG CGT 
pro gly ser ile ser arg ala thr arg arg 
211/71 
TCG AGA TC 
ser arg) 



SEQ ID NOS: 552-553 

FIG. 31 C 



ORF according to Cole et al 
1/1 SEQ ID NO: 554 



taa acg 


acc 


cca 


aca 


gca 


cga 


OCH(£hr 


thr 


pro 


thr 


ala 


arg 


61/21"" 


SEQ ID 


NO: 


555 




cga ttt 


cgc 


gcg 


caa 


cgc 


ggc 


arg phe 


arg 


ala 


gin 


arg 


gly 


121/41 












ccc egg 


cgt 


cgt 


gtt 


cgt 


ggc 


pro arg 


arg 


arg 


val 


arg 


gly 


181/61 












cag ccc 


gac 


ccc 


agg 


cat 


gee 


gin pro 


asp 


pro 


arg 


his 


ala 


241/81 












gtc gec 


ggc 


gcg 


ggt 


gcg 


gcg 


val ala 


gly 


ala 


gly 


ala 


ala 


301/101 












cgc cgc 


ttc 


ggt 


gac 


ctt 


cgt 


arg arg 


phe 


gly 


asp 


leu 


arg 


361/121 












ggt gaa 


gtg 


cac 


cag 


gcg 


ccc 


gly glu 


val 


his 


gin 


ala 


pro 


421/141 












ctt gcg 


ccc 


ttc 


ccc 


ggt 


ggc 


leu ala 


pro 


phe 


pro 


gly 


gly 


481/161 












ggc ttc 


cag 


cga 


gat 


ttg 


etc 


gly phe 


gin 


arg 


asp 


leu 


leu 


541/181 












ggc ttt 


egg 


cag 


ctg 


tec 


gtc 


gly phe 


arg 


gin 


leu 


ser 


val 


601/201 












gaa ggt 


aaa 


tgc 


ggc 


cgc 


gaa 


glu gly 


lys 


cys 


gly 


arg 


glu 


661/221 












etc ttc 


ggt 


etc 


ccc 


ggg 


gaa 


leu phe 


gly 


leu 


pro 


gly 


glu 


721/241 












ggc cgc 


ccg 


cac 


gcg 


etc 


gat 


gly arg 


pro 


his 


ala 


leu 


asp 


781/261 












cgc gcg 


cag 


gat 


ccg 


gtc 


gga 


arg ala 


gin 


asp 


pro . 


val 


gly 



. (Nature 393:537-544) 
31/11 



aca 


cca 


yy a 


ttg cga 


caa 


thr 


pro 


y -*-y 


leu arg 


gin 


gtt 


ctg 


ccg 


ect cga 


tct 


val 


leu 


pro 




ser 








151/51 




tea 


tea 


tct 


gca tee 


tec 


ser 


ser 


ser 


ala ser 


ser 








211/71 




cag 


CTCC 


gac 


y y *— y^-y 


cec 


gin 


ala 


asp 


gly ala 


pro 








271/91 




gtg 


ggt 


cag 


gac gee 


ggc 


val 


gly 


gin 


asp ala 


gly 








331/111 




ggt 


gat 


gac 


gtc gee 


ggg 


gly 


asp 


asp 


val ala 


gly 








391/131 




gtc 


gcg 


cgc 


ccg ccc 


get 


val 


ala 


arg 


pro pro 


ala 








451/151 




cac 


cag 


cac 


etc gac 


ggc 


his 


gin 


his 


leu asp 


gly 








511/171 




ctg 


cag 


cgc 


gat cag 


gcg 


leu 


gin 


arg 


asp gin 


ala 








571/191 




gag 


ttg 


cgc 


ggc egg 


tgt 


glu 


leu 


arg 


gly arg 


cys 








631/211 




gcg 


ggc 


ccg 


gcg cac 


cac 


ala 


gly 


pro 


ala his 


his 








691/231 




acc 


gac 


gat 


cag ate 


ggt 


thr 


asp 


asp 


gin ile 


gly 








751/251 




gat 


gee 


gag 


gta gcg 


etc 


asp 


ala 


glu 


val ala 


leu 








811/271 




tec 


gga 


ctg 


tag 




ser 


gly 


leu) AMB 





id containing seq31A 

cca aag ccc teg cgc ctg get 
pro lys pro ser arg leu ala 
91/31 

cag cgc gga ggg cgt cga gat 
gin arg gly gly arg arg asp 

ggg ctt ggc cgc get gac egg 
gly leu gly arg ala asp arg 

egg ctg ccc ggc ggt gtg cgc 
arg leu pro gly gly val arg 

gtc ggc gat gag gtg gtg egg 
val gly asp glu val val arg 

acg cac gcg egg ctg gee ggc 
thr his ala arg leu ala gly 

cat gcg cgc cgt gac ggt gtc 
his ala arg arg asp gly val 

ctg ccc gac cag ggc gcg gtt 
leu pro asp gin gly ala val 

ttc ata gcg ttc ctg cac aac 
phe ile ala phe leu his asn 

ccc ggg ccg ctt gga gta ttg 
pro gly pro leu gly val leu 

gtc gag cgt ggc cgc gaa gtc 
val glu arg gly arg glu val 

ggt aat cgc ggc atg egg gat 
gly asn arg gly met arg asp 

ggc acg ata gga ccg ccg cat 
gly thr ile gly pro pro his 



SEQ ID NOS: 554-555 



FIG. 31 D 




\ D££L— ''SEQ ID NO: 556 

aga ctg gtg tac acg gag 
( arg l eu val tyr thr glu 
61/21^SEQ ID NO: 557 
eta gtg aag gtc att cag 
leu val lys val ile gin 
121/41 

ttc gtc ggt ttc gcg egg 
phe val gly phe ala arg 
181/61 

ccg ttg egg gac tac gaa 
pro leu arg asp tyr glu 
241/81 

ggg ccg acc gcg ctg aac 
gly pro thr ala leu asn 
301/101 

tac ggg cgc ate aaa cgc 
tyr gly arg lie lys arg 



31/11 



acc 


aag 


ctg 


aac 


teg gca 


ttc 


tnr 


lys 


leu 


asn 


ser ala 


phe 










91/31 




aaa 


ctg 


teg 


ggc 


ttg age 


ate 


lys 


leu 


ser 


gly 


leu ser 


ile 










151/51 




atg 


gtc 


gag 


gee 


etc ggc 


ggc 


met 


val 


glu 


ala 


leu gly 


gly 










211/71 




ctg 


ggc 


acg 


gtg 


ctg gag 


cac 


leu 


gly 


thr 


val 


leu glu 


his 










271/91 




tat 


gtg 


cgc 


get 


cgc cag 


gtc 


tyr 


val 


arg 


ala 


arg gin 


val 










331/111 




cag 


cag 


ttg 


ttt 


ttg teg 


teg 


gin 


gin 


leu 


phe 


leu ser 


ser 



SEQ ID NOS: 556-557 



tec ttc ggc ggg cct aag tgt 
ser phe gly gly pro lys cys 

aac egg ttc ate gcg att gac 
asn arg phe ile ala ile asp 

gtc gag gta tgc age acc acc 
val glu val cys ser thr thr 

gee gga cgc cag gtc att gac 
ala gly arg gin val ile asp 

acc acc gag age aat ggc gac 
thr thr glu ser asn gly asp 

ctg ctg cgt teg atg ate 
leu leu arg ser met ile) 



FIG. 32A 



1/l^SEQ ID NO: 558 



gac tgg tgt aca egg aga cca age tga act 
(asp t rp cys thr arg arg pro ser)OPA(thr 
61/21 SEQ ID NO: 559 SEQ ID NO: 560 



tag tga 


agg 


tea 


ttc 


aga 


aac 


tgt 


egg 


get 


AMB OPA(arg 


ser 


phe 


arg 


asn 


cys 


arg 


ala 


121/41 * 


SEQ ID NO 


: 561 








teg teg 


gtt 


teg 


cgc 


gga 


tgg 


teg 


agg 


ccc 


ser ser 


val 


ser 


arg 


gly 


trp 


ser 


arg 


pro 


181/61 


















cgt tgc 


ggg 


act 


acg 


aac 


tgg 


gca 


egg 


tgc 


arg cys 


gly 


thr 


thr 


asn 


trp 


ala 


arg 


cys 


241/81 


















ggc cga 


ccg 


cgc 


tga 


act 


atg 


tgc 


gcg 


etc 


gly arg 


pro 


arg)OPA(thr 


met 


cys 


ala 


leu 


301/101 








SEQ ID NO: 


563 


acg ggc 


gca 


tea 


aac 


gee 


age 


agt 


tgt 


ttt 


thr gly 


ala 


ser 


asn 


ala 


ser 


ser 


cys 


phe 



31/11 


















egg cat 


tct 


cct 


teg 


gcg 


ggc 


eta 


agt 


gtc 


arg his 


ser 


pro 


ser 


ala 


gly 


leu 


ser 


val) 


91/31 


















tga gca 


tea 


acc 


ggt 


tea 


teg 


cga 


ttg 


act 


OPA(ala 


ser 


thr 


gly 


ser 


ser 


arg 


leu 


thr 


15l751^" 


SEQ ID 


NO: 


562 










tcg gcg 


gcg 


teg 


agg 


tat 


gca 


gca 


cca 


ccc 


ser ala 


ala 


ser 


arg 


tyr 


ala 


ala 


pro 


pro 


211/71 


















tgg age 


acg 


ccg 


gac 


gee 


agg 


tea 


ttg 


acg 


trp ser 


thr 


pro 


asp 


ala 


arg 


ser 


leu 


thr 


271/91 


















gee agg 


tea 


cca 


ccg 


aga 


gca 


atg 


gcg 


act 


ala arg 


ser 


pro 


pro 


arg 


ala 


met 


ala 


thr 


331/111 


















tgt cgt 


cgc 


tgc 


tgc 


gtt 


cga 


tga 


tc 




cys arg 


arg 


cys 


cys 


val 


arg)OPA 







SEQ ID NOS: 558-563 



FIG. 32B 




31/11 



act ggt 


gta 


cac 


gga 


gac 


caa 


get 


gaa 


etc 


ggc att 


etc 


ctt 


egg 


egg 


gcc 


taa gtg 


tct 


(thr gly 


val 


his 


gly 


asp 


gin 


ala 


glu 


leu 


gly ile 


leu 


leu 


arg 


arg 


ala)OCH(val 


ser 


61/21 SEQ ID NO: 565 










91/31 




SEQ ID NO: 566-^* 




agt gaa 


ggt 


cat 


tea 


gaa 


act 


gtc 


ggg 


ctt 


gag cat 


caa 


ccg 


gtt 


cat 


cgc 


gat tga 


ctt 


ser glu 


gly 


his 


ser 


glu 


thr 


val 


gly 


leu 


glu his 


gin 


pro 


val 


his 


arg 


asp)OP(leu 


121/41 


















151/51 






SEQ ID NO 


: 567--^ 




cgt egg 


ttt 


cgc 


gcg 


gat 


ggt 


cga 


ggc 


cct 


egg egg 


cgt 


cga 


ggt 


atg 


cag 


cac cac 


ccc 


arg arg 


phe 


arg 


ala 


asp 


gly 


arg 


gly 


pro 


arg arg 


arg 


arg 


gly 


met 


gin 


his his 


pro 


181/61 


















211/71 












gtt gcg 


gga 


eta 


cga 


act 


ggg 


cac 


ggt 


get 


gga gca 


cgc 


egg 


acg 


cca 


ggt 


cat tga 


egg 


val ala 


gly 


leu 


arg 


thr 


gly 


his 


gly 


ala 


gly ala 


arg 


arg 


thr 


pro 


gly 


his)OPA(arg 


241/81 


















271/91 






SEQ ID NO 


: 568 ^ 




gcc gac 


cgc 


get 


gaa 


eta 


tgt 


gcg 


cgc 


teg 


cca ggt 


cac 


cac 


cga 


gag 


caa 


tgg cga 


eta 


ala asp 


arg 


ala 


glu 


leu 


cys 


ala 


arg 


ser 


pro gly 


his 


his 


arg 


glu 


gin 


trp arg 


leu 


301/101 


















331/111 












egg gcg 


cat 


caa 


acg 


cca 


gca 


gtt 


gtt 


ttt 


gtc gtc 


get 


get 


gcg 


ttc 


gat 


gat c 




arg ala 


his 


gin 


thr 


pro 


ala 


val 


val 


phe 


val val 


ala 


ala 


ala 


phe 


asp 


asp) 





SEQ ID NOS: 564-568 



FIG. 32C 

sequence Rv0822c predicted by Cole et al . (Nature 393:537-544) and containing 
seg 32A 



1/l^SEQ ID NO: 569 31/11 



atg agt 
(Met ser 
61/21^- 


gac 
asp 
-SEQ 


ggc gag age 
gly glu ser 
ID NO: 570 


gcc 
ala 


gcg 
ala 


ccg 
pro 


tgg 
trp 


gca egg 
ala arg 
91/31 


etc 
leu 


tec 
ser 


gag 
glu 


tea 
ser 


gca 
ala 


ttc 
phe 


ccc 
pro 


gat 
asp 


ggt gtt 
gly val 
121/41 


gac 
asp 


cga 
arg 


tgg 
trp 


ate 
ile 


acg 
thr 


gta 
val 


ccg 
pro 


ccc 
pro 


gcc aca 
ala thr 
151/51 


tgg 
trp 


gtg 

val 


gca 
ala 


gcc 
ala 


cag 
gin 


ggt 
gly 


ccg 
pro 


egg 
arg 


gac ace 
asp thr 
181/61 


cag 
gin 


aat 
asn 


gtc 
val 


ggc 
gly 


tgt 
cys 


cat 
his 


gcc 
ala 


acc 
thr 


ggc gcc 
gly ala 
211/71 


gtt 
val 


agt 
ser 


gtg 
val 


gcc 
ala 


gat 
asp 


ctg 
leu 


ate 
ile 


gcc 
ala 


agg etc 
arg leu 
241/81 


ggc 

gly 


ccc 
pro 


get 
ala 


ttt 
phe 


cct 
pro 


gac 
asp 


etc 
leu 


ccc 
pro 


acg cac 
thr his 
271/91 


cgc 
arg 


cat 
his 


gtc 
val 


gcc 
ala 


ccc 
pro 


gaa 
glu 


ccc 
pro 


gag 
glu 


cca tec 
pro ser 
301/101 


ggc 
gly 


cgc 
arg 


ggc 
gly 


ccg 
pro 


aag 
lys 


gtc 
val 


cac 
his 


gac 
asp 


gac gcc 
asp ala 
331/111 


gac 
asp 


gac 
asp 


cag 
gin 


cag 
gin 


gac 
asp 


acc 
thr 


gag 
glu 


get 
ala 


ate gcc 
ile ala 
361/121 


ate 
ile 


ccg 
pro 


gcc 
ala 


cac 
his 


teg 
ser 


etc 
leu 


gag 
glu 


ttc 
phe 


etc teg 
leu ser 
391/131 


gag 
glu 


ctt 
leu 


ccc 
pro 


gac 
asp 


etc 
leu 


egg 
arg 


gca 
ala 


gcc 
ala 


aac tat 
asn tyr 
421/141 


ccg 
pro 


cgc 
arg 


gcc 
ala 


gac 
asp 


cac 
his 


gcc 
ala 


cgc 
arg 


cgt 
arg 


gaa ccc 
glu pro 
451/151 


gag 
glu 


eta 
leu 


ccc 
pro 


ggc 
gly 


aag 
lys 


cag 
gin 


eta 
leu 


acc 
thr 


gga teg 
gly ser 
481/161 


get 
ala 


cga 
arg 


gtg 
val 


egg 
arg 


cca 
pro 


ttg 
leu 


egg 
arg 


ate 
ile 


cgc cga 
arg arg 
511/171 


acg 
thr 


teg 
ser 


ccc 
pro 


gcg 
ala 


ccc 
pro 


gcc 
ala 


aag 
lys 


cca 
pro 


gcg ccg 
ala pro 
541/181 


aac 
asn 


tec 
ser 


ggc 
gly 


egg 
arg 


cgc 
arg 


ccg 
pro 


atg 
met 


gtg 

val 


ctg gcc 
leu ala 
571/191 


gcg 
ala 


cgc 
arg 


teg 
ser 


ctg 
leu 


gcg 
ala 


get 
ala 


ctg 
leu 


ttt 
phe 


gcc get 
ala ala 
601/201 


ctg 
leu 


gcg 
ala 


ttg 
leu 


gcg 
ala 


ctg 
leu 


ace 
thr 


ggc 
gly 


ggg 
gly 


gca tgg 
ala trp 
631/211 


cag 
gin 


tgg 
trp 


age 
ser 


gcg 

ala 


teg 
ser 


aag 
lys 


aac 
asn 


age 
ser 


egg ctg 
arg leu 


aac 
asn 


atg 
met 


gta 
val 


age 
ser 


gcg 

ala 


etc 
leu 


gac 
asp 


ccg 
pro 


cat teg 
his ser 


ggc 
gly 


gac 
asp 


ate 
ile 


gtc 
val 


aac 
asn 


ccc 
pro 


age 
ser 


ggg 
gly 



SEQ XD NOS: 569-570 



FIG. 32D 



aw a 

661/221 

cag cat ggc gac gag aac ttc ttg etc gtc 
gin his gly asp glu asn phe leu leu val 
721/241 

aat ate ggc gec ggc gac gec gag gac gec 
asn ile gly ala gly asp ala glu asp ala 
781/261 

gtc aac att ccg gec age cgc gag egg gtc 
val asn ile pro ala ser arg glu arg val 
841/281 

ate act cca ate caa tgc gag gcg tgg aac 
ile thr pro ile gin cys glu ala trp asn 
901/301 

gac gag aag acg gga acg atg ggt ccc aga 
asp glu lys thr gly thr met gly pro arg 
961/321 

gca ttc tec ttc ggc ggg cct aag tgt eta 
ala phe ser phe gly gly pro lys cys leu 
1021/341 

age ate aac egg ttc ate gcg att gac ttc 
ser ile asn arg phe ile ala ile asp phe 
1081/361 

ggc ggc gtc gag gta tgc age acc acc ccg 
gly gly val glu val cys ser thr thr pro 
1141/381 

gag cac gec gga cgc cag gtc att gac ggg 
glu his ala gly arg gin val ile asp gly 
1201/401 

cag gtc acc acc gag age aat ggc gac tac 
gin val thr thr glu ser asn gly asp tyr 
1261/421 

teg teg ctg ctg cgt teg atg ate teg acg 
ser ser leu leu arg ser met ile ser thr 
1321/441 

aac gtc gtc aac atg ttc ate ggt aac age 
asn val val asn met phe ile gly asn ser 
1381/461 

gtc gaa etc ggt cga teg ttg cag cat atg 
val glu leu gly arg ser leu gin his met 
1441/481 

ccg acc ggt ata acc gac cag aac ggc gac 
pro thr gly ile thr asp gin asn gly asp 
1501/501 

ctt ttc acc gee ate ate gac gac gat ccg 
leu phe thr ala ile . ile asp asp asp pro 
1561/521 

cgt ctg ggc aac acg ccg teg acc ccg ccg 
arg leu gly asn thr pro ser thr pro pro 
1621/541 

ctg acc aac gag att cag cac cag cag gtt 
leu thr asn glu ile gin his gin gin val 
1681/561 

cag gtc tct aac teg acc ggc cag gee ggt 
gin val ser asn ser thr gly gin ala gly 
1741/581 

egg aac ggc ttc aac gtg atg get ccg gac 
arg asn gly phe asn val met ala pro asp 
1801/601 

gtg ttt ttt teg ccc ggc aac gaa cag get 
val phe phe ser pro gly asn glu gin ala 
1861/621 

tea aag ate gag egg gtg acc ggg ate ggc 
ser lys ile glu arg val thr gly ile gly 



691/231 

ggt atg gac tct cgt gee ggg gcg aac gec 
gly met asp ser arg ala gly ala asn ala 
751/251 

ggc ggc gca cgt teg gac acc gtc atg ctg 
gly gly ala arg ser asp thr val met leu 
811/271 

gtc gcg gtg teg ttc ccc cgc gac ctg gcg 
val ala val ser phe pro arg asp leu ala 
871/291 

ccc gag acc ggt aag tac gga ccc ate tac 
pro glu thr gly lys tyr gly pro ile tyr 
931/311 

ctg gtg tac acg gag acc aag ctg aac teg 
leu val tyr thr glu thr lys leu asn ser 
991/331 

gtg aag gtc att cag aaa ctg teg ggc ttg 
val lys val ile gin lys leu ser gly leu 
1051/351 

gtc ggt ttc gcg egg atg gtc gag gee etc 
val gly phe ala sarg met val glu ala leu 
1111/371 

ttg egg gac tac gaa ctg ggc acg gtg ctg 
leu arg asp tyr glu leu gly thr val leu 
1171/391 

ccg acc gcg ctg aac tat gtg cgc get cgc 
pro thr ala leu asn tyr val arg ala arg 
1231/411 

ggg cgc ate aaa cgc cag cag ttg ttt ttg 
gly arg ile lys arg gin gin leu phe leu 
1291/431 

gac acc ttg ttc aac etc age agg etc aac 
asp thr leu phe asn leu ser arg leu asn 
1351/451 

tac gtg gac aac gtc aag acc aaa gac ctg 
tyr val asp asn val lys thr lys asp leu 
1411/471 

gcg gec ggg cac gtc acg ttc gtg acc gtt 
ala ala gly his val thr phe val thr val 
1471/491 

gag ccc ccg cgt acc tec gac atg aag gcg 
glu pro pro arg thr ser asp met lys ala 
1531/511 

ctg ccc ctg gaa aac gat cac aac gee cag 
leu pro leu glu asn asp his asn ala gin 
1591/531 

acc acc acc aag aag gcg ccg cag gcg ggt 
thr thr thr lys lys ala pro gin ala gly 
1651/551 

acg acg acc teg cca aaa gag gtc aca gtg 
thr thr thr ser pro lys glu val thr val 
1711/571 

ttg gee acc acc gee acc gat cag etc aag 
leu ala thr thr ala thr asp gin leu lys 
1771/591 

gac tac ccg agt teg ctg ctg gee acc aca 
asp tyr pro ser ser leu leu ala thr thr 
1831/611 

gee gee acc gtg gec gee gtg ttc ggc cag 
ala ala thr val ala ala val phe gly gin 
1891/631 

caa ctg gtc cag gtg gtg ctg ggc caa gac 
gin leu val gin val val leu gly gin asp 



SEQ ID NOS: 569-570 (continued 1) 



(continued 1) 



1921/641 1951/651 

ttc age gcg gtg cgc get ccc ctg ccg agt ggc tec acc gtc age gtg cag ata age cgc 
phe ser ala val arg ala pro leu pro ser gly ser thr val ser val gin ile ser arg 
1981/661 2011/671 

aac tec tec age cca ccg acc aag ctg ccc gag gac ctg acg gtc acc aac gee gec gac 
asn ser ser ser pro pro thr lys leu pro glu asp leu thr val thr asn ala ala asp 
2041/681 

acc acc tgc gag tag 
thr thr cys glu) AMB 



SEQ ID NOS: 569-570 (continued 2) 

FIG. 32D (continued 2 ) 



ORF according to Cole et al . (Nature 39: 
1/1 SEQ ID NO: 571 

tag gac atg agt gac ggc gag age gee gcg 
AMB (asp met ser asp gly glu ser ala ala 
61/2T^SEQ ID NOl 572 

ccc gat ggt gtt gac cga tgg ate acg gta 
pro asp gly val asp arg trp ile thr val 
121/41 

ccg egg gac acc cag aat gtc ggc tgt cat 
pro arg asp thr gin asn val gly cys his 
181/61 

ate gee agg etc ggc ccc get ttt cct gac 
ile ala arg leu gly pro ala phe pro asp 
241/81 

ccc gag cca tec ggc cgc ggc ccg aag gtc 
pro glu pro ser gly arg gly pro lys val 
301/101 

gag get ate gee ate. ccg gee cac teg etc 
glu ala ile ala ile pro ala his ser leu 
361/121 

gca gec aac tat ccg cgc gee gac cac gee 
ala ala asn tyr pro arg ala asp his ala 
421/141 

eta acc gga teg get cga gtg egg cca ttg 
leu thr gly ser ala arg val arg pro leu 
481/161 

aag cca gcg ccg aac tec ggc egg cgc ccg 
lys pro ala pro asn ser gly arg arg pro 
541/181 

ctg ttt gee get ctg gcg ttg gcg ctg acc 
leu phe ala ala leu ala leu ala leu thr 
601/201 

aac age egg ctg aac atg gta age gcg etc 
asn ser arg leu asn met val ser ala leu 
661/221 

age ggg cag cat ggc gac gag aac ttc ttg 
ser gly gin his gly asp glu asn phe leu 
721/241 

aac gee aat ate ggc gee ggc gac gee gag 
asn ala asn ile gly ala gly asp ala glu 
781/261 

atg ctg gtc aac att ccg gee age cgc gag 
met leu val asn ile pro ala ser arg glu 



SEQ 



1:537-544) and containing Rv0822c 
31/11 

ccg tgg gca egg etc tec gag tea gca ttc 

pro trp ala arg leu ser glu ser ala phe 
91/31 

ccg ccc gee aca tgg gtg gca gec cag ggt 

pro pro ala thr trp val ala ala gin gly 
151/51 

gec acc ggc gee gtt agt gtg gee gat ctg 

ala thr gly ala val ser val ala asp leu 
211/71 

etc ccc acg cac cgc cat gtc gee ccc gaa 

leu pro thr his arg his val ala pro glu 
271/91 

cac gac gac gee gac gac cag cag gac acc 

his asp asp ala asp asp gin gin asp thr 
331/111 

gag ttc etc teg gag ctt ccc gac etc egg 

glu phe leu ser glu leu pro asp leu arg 
391/131 

cgc cgt gaa ccc gag eta ccc ggc aag cag 

arg arg glu pro glu leu pro gly lys gin 
451/151 

egg ate cgc cga acg teg ccc gcg ccc gee 

arg ile arg arg thr ser pro ala pro ala 
511/171 

atg gtg ctg gee gcg cgc teg ctg gcg get 

met val leu ala ala arg ser leu ala ala 
571/191 

ggc ggg gca tgg cag tgg age gcg teg aag 

gly gly ala trp gin trp ser ala ser lys 
631/211 

gac ccg cat teg ggc gac ate gtc aac ccc 

asp pro his ser gly asp ile val asn pro 
691/231 

etc gtc ggt atg gac tct cgt gee ggg gcg 

leu val gly met asp ser arg ala gly ala 
751/251 

gac gee ggc ggc gca cgt teg gac acc gtc 

asp ala gly gly ala arg ser asp thr val 
811/271 

egg gtc gtc gcg gtg teg ttc ccc cgc gac 

arg val val ala val ser phe pro arg asp 



NOS: 571-572 



FIG. 32E 




841/281 

ctg gcg ate act cca ate caa tgc gag gcg 
leu ala ile thr pro ile gin cys glu ala 
901/301 

ate tac gac gag aag acg gga acg atg ggt 
ile tyr asp glu lys thr gly thr met gly 
961/321 

aac teg gca ttc tec ttc ggc ggg cct aag 
asn ser ala phe ser phe gly gly pro lys 
1021/341 

ggc ttg age ate aac egg ttc ate gcg att 
gly leu ser ile asn arg phe ile ala ile 
1081/361 

gee etc ggc ggc gtc gag gta tgc age ace 
ala leu gly gly val glu val cys ser thr 
1141/381 

gtg ctg gag cac gee gga cgc cag gtc att 
val leu glu his ala gly arg gin val ile 
1201/401 

get cgc cag gtc ace acc gag age aat ggc 
ala arg gin val thr thr glu ser asn gly 
1261/421 

ttt ttg teg teg ctg ctg cgt teg atg ate 
phe leu ser ser leu leu arg ser met ile 
1321/441 

etc aac aac gtc gtc aac atg ttc ate ggt 
leu asn asn val val asn met phe ile gly 
1381/461 

gac ctg gtc gaa etc ggt cga teg ttg cag 
asp leu val glu leu gly arg ser leu gin 
1441/481 

acc gtt ccg acc ggt ata acc gac cag aac 
thr val pro thr gly ile thr asp gin asn 
1501/501 

aag gcg ctt ttc acc gee ate ate gac gac 
lys ala leu phe thr ala ile ile asp asp 
1561/521 

gec cag cgt ctg ggc aac acg ccg teg acc 
ala gin arg leu gly asn thr pro ser thr 
1621/541 

gcg ggt ctg acc aac gag att cag cac cag 
ala gly leu thr asn glu ile gin his gin 
1681/561 

aca gtg cag gtc tct aac teg acc ggc cag 
thr val gin val ser asn ser thr gly gin 
1741/581 

etc aag egg aac ggc ttc aac gtg atg get 
leu lys arg asn gly phe asn val met ala 
1801/601 

acc aca gtg ttt ttt teg ccc ggc aac gaa 
thr thr val phe phe ser pro gly asn glu 
1861/621 

ggc cag tea aag ate gag egg gtg acc ggg 
gly gin ser lys ile glu arg val thr gly 
1921/641 

caa gac ttc age gcg gtg cgc get ccc ctg 
gin asp phe ser ala val arg ala pro leu 
1981/661 

age cgc aac tec tec age cca ccg acc aag 
ser arg asn ser ser ser pro pro thr lys 
2041/681 

gee gac acc acc tgc gag tag 
ala asp thr thr cys glu) AMB 



871/291 

tgg aac ccc gag acc ggt aag tac gga ccc 
trp asn pro glu thr gly lys tyr gly pro 
931/311 

ccc aga ctg gtg tac acg gag acc aag ctg 
pro arg leu val tyr thr glu thr lys leu 
991/331 

tgt eta gtg aag gtc att cag aaa ctg teg 
cys leu val lys val ile gin lys leu ser 
1051/351 

gac ttc gtc ggt ttc gcg egg atg gtc gag 
asp phe val gly phe ala arg met val glu 
1111/371 

acc ccg ttg egg gac tac gaa ctg ggc acg 
thr pro leu arg asp tyr glu leu gly thr 
1171/391 

gac ggg ccg acc gcg ctg aac tat gtg cgc 
asp gly pro thr ala leu asn tyr val arg 
1231/411 

gac tac ggg cgc ate aaa cgc cag cag ttg 
asp tyr gly arg ile lys arg gin gin leu 
1291/431 

teg acg gac acc ttg ttc aac etc age agg 
ser thr asp thr leu phe asn leu ser arg 
1351/451 

aac age tac gtg gac aac gtc aag acc aaa 
asn ser tyr val asp asn val lys thr lys 
1411/471 

cat atg gcg gee ggg cac gtc acg ttc gtg 
his met ala ala gly his val thr phe val 
1471/491 

ggc gac gag ccc ccg cgt acc tec gac atg 
gly asp glu pro pro arg thr ser asp met 
1531/511 

gat ccg ctg ccc ctg gaa aac gat cac aac 
asp pro leu pro leu glu asn asp his asn 
1591/531 

ccg ccg acc acc acc aag aag gcg ccg cag 
pro pro thr thr thr lys lys ala pro gin 
1651/551 

cag gtt acg acg acc teg cca aaa gag gtc 
gin val thr thr thr ser pro lys glu val 
1711/571 

gec ggt ttg gee acc acc gee acc gat cag 
ala gly leu ala thr thr ala thr asp gin 
1771/591 

ccg gac gac tac ccg agt teg ctg ctg gee 
pro asp asp tyr pro ser ser leu leu ala 
1831/611 

cag get gec gec acc gtg gee gee gtg ttc 
gin ala ala ala thr val ala ala val phe 
1891/631 

ate ggc caa ctg gtc cag gtg gtg ctg ggc 
ile gly gin leu val gin val val leu gly 
1951/651 

ccg agt ggc tec acc gtc age gtg cag ata 
pro ser gly ser thr val ser val gin ile 
2011/671 

ctg ccc gag gac ctg acg gtc acc aac gee 
leu pro glu asp leu thr val thr asn ala 



SEQ ID NOS: 571-572 (continued 1) 



FIG. 32E (continued 1) 




1/l^SEQ ID 


NO: 


573 












31/11 
















CGT CAC 


CTC 


TGC 


CAT 


GGT 


CCA 


TCT 


ACG 


GTA 


TCT GCG 


ACA AGG 


GCA 


GCG 


TCG 


ATC 


CCT 


CGA 


(arg his 


leu 


cys 


his 


gly 


pro 


ser 


thr 


val 


ser ala 


thr arg 


ala 


ala 


ser 


ile 


pro 


arg 


61/21^- 


SEQ 


ID NO: 574 










9i/31 
















CAT GCA 


GAG 


TCG 


GTG 


TTC 


GCT 


TCA 


CGC 


GAA 


CTA GGC 


GCG CCT 


AGC 


CTG 


GAC 


GAG 


TCC 


CCG 


his ala 


glu 


ser 


val 


phe 


ala 


ser 


arg 


glu 


leu gly 


ala pro 


ser 


leu 


asp 


glu 


ser 


pro 


121/41 


















151/51 
















GGC CGA 


CAT 


TCG 


CCC 


GAG 


GCC 


TTG 


GCC 


TCC 


ATC ACC 


TAA TTG 


TGT 


GCA 


AAA 


CCG 


TAT 


CTA 


gly arg 


his 


ser 


pro 


glu 


ala 


leu 


ala 


ser 


ile thr)OCH(leu 


cys 


ala 


lys 


pro 


tyr 


leu 


181/61 


















211/71 


* SEQ ID NO: 575 






ATT GAT 


ACG 


ATT 


GCG 


CAC 


ATG 


GCT 


ATC 


TGG 


GAT C 
















ile asp 


thr 


ile 


ala 


his 


met 


ala 


ile 


trp 


asp) 

















SEQ ID NOS: 573-575 



FIG. 33A 

1/1^-SEQ ID NO: 576 31/11 



GTC ACC 


TCT 


GCC 


ATG 


GTC 


CAT 


CTA 


CGG 


TAT 


CTG CGA 


CAA GGG 


CAG 


CGT 


CGA 


TCC 


CTC 


GAC 


(val thr 


ser 


ala 


met 


val 


his 


leu 


arg 


tyr 


leu arg 


gin gly 


gin 


arg 


arg 


ser 


leu 


asp 


61/21^SEQ ID NO: 577 










91/31 
















ATG CAG 


AGT 


CGG 


TGT 


TCG 


CTT 


CAC 


GCG 


AAC 


TAG GCG 


CGC CTA 


GCC 


TGG 


ACG 


AGT 


CCC 


CGG 


met gin 


ser 


arg 


cys 


ser 


leu 


his 


ala 


asn) AMB (ala 


arg leu 


ala 


trp 


thr 


ser 


pro 


arg 


121/41 


















151/51^ 


^SEQ ID 


NO: 


578 










GCC GAC 


ATT 


CGC 


CCG 


AGG 


CCT 


TGG 


CCT 


CCA 


TCA CCT 


AAT TGT 


GTG 


CAA 


AAC 


CGT 


ATC 


TAA 


ala asp 


ile 


arg 


pro 


arg 


pro 


trp 


pro 


pro 


ser pro 


asn cys 


val 


gin 


asn 


arg 


ile)OCH 


181/61 


















211/71 
















TTG ATA 


CGA 


TTG 


CGC 


ACA 


TGG 


CTA 


TCT 


GGG 


ATC 

















(leu ile arg leu arg thr trp leu ser gly ile) 
^v_SEQ ID NO: 579 

SEQ ID NOS: 576-579 

FIG. 33B 



1/1 -^-SEQ ID NO: 580 

CCG TCA CCT CTG CCA TGG TCC ATC TAC GGT 
( pro ser pro leu pro trp ser ile tyr gly 
61/21 ^ — SEQ ID NO: 581 

ACA TGC AGA GTC GGT GTT CGC TTC ACG CGA 
thr cys arg val gly val arg phe thr arg 
121/41 

GGG CCG ACA TTC GCC CGA GGC CTT GGC CTC 
gly pro thr phe ala arg gly leu gly leu 
181/61 

AAT TGA TAC GAT TGC GCA CAT GGC TAT CTG 
asn)OPA(tyr asp cys ala his gly tyr leu 
SEQ ID NO: 583 

SEQ ID NC 



31/11 

ATC TGC GAC AAG GGC AGC GTC GAT CCC TCG 
ile cys asp lys gly ser val asp pro ser 
91/31 

ACT AGG CGC GCC TAG CCT GGA CGA GTC CCC 
thr arg arg ala) AMB (pro gly arg val pro 
151/51 ^— -SEQ ID NO: 582 
CAT CAC CTA ATT GTG TGC AAA ACC GTA TCT 
his his leu ile val cys lys thr val ser 
211/71 
GGA TC 

gly) 



:580-583 



FIG. 33C 



sequence Rvl044 predicted by Cole et al . (Nature 393:537-544) and 
containing seq33A 



1/l^SEQ ID NO: 584 

ttg tgt gca aaa ccg tat eta att gat acg 
( leu c ys ala lys pro tyr leu ile asp thr 
61/2 f*— SEQ ID NO: 585 

gtc gag gtt gec gec gag caa cat ggc tac 
val glu val ala ala glu gin his gly tyr 
121/41 

gtc gac cct gtg cag etc cgc etc eta gcg 
val asp pro val gin leu arg leu leu ala 
181/61 

ggt gtg tac egg gtg ccc gtg ctg ccg cgt 
gly val tyr arg val pro val leu pro arg 
241/81 

teg tgg act ttg ggg cgt ggc gtt ate teg 
ser trp thr leu gly arg gly val ile ser 
301/101 

get gac gtg aac ccg teg cgc ate cat etc 
ala asp val asn pro ser arg ile his leu 
361/121 

gee ggg ggc gag ctg tac cga gtt cac cgc 
ala gly gly glu leu tyr arg val his arg 
421/141 

gtc gac gga ata ccc gtc acg acg gtt gcg 
val asp gly ile pro val thr thr val ala 
481/161 

acg gat cct tat cag ctt egg gee gcg ate 
thr asp pro tyr gin leu arg ala ala ile 
541/181 

cgt ggg tea gca get gag eta cgc get gcg 
arg gly ser ala ala glu leu arg ala ala 
601/201 

egg ccg aag cga gca teg gcg tga 
arg pro lys arg ala ser ala)OPA 



31/11 

att gcg cac atg get ate tgg gat cgc etc 
ile ala his met ala ile trp asp arg leu 
91/31 

gtc acg act cgc gat gcg cga gac ate ggc 
val thr thr arg asp ala arg asp ile gly 
151/51 

ggg cgc gga cgt ctt gag cgt gtc ggc cga 
gly arg gly arg leu glu arg val gly arg 
211/71 

ggt gag cac gac gat etc gca gee gca gtg 
gly glu his asp asp leu ala ala ala val 
271/91 

cat gag teg gec ttg gcg ctt cat gee etc 
his glu ser ala leu ala leu his ala leu 
331/111 

ace gtc ccg cgc aac aac cat ccg cgt gcg 
thr val pro arg asn asn his pro arg ala 
391/131 

cgc gac etc cag gca gee cac gtc act teg 
arg asp leu gin ala ala his val thr ser 
451/151 

cgc ace ate aaa gac tgc gtg aag acg ggc 
arg thr ile lys asp cys val lys thr gly 
511/171 

gag cga gee gaa gee gag ggc acg ctt cgt 
glu arg ala glu ala glu gly thr leu arg 
571/191 

etc gat gag acc act gee gga tta cgc get 
leu asp glu thr thr ala gly leu arg ala 



SEQ ID NOS: 584-585 



FIG. 33D 




ORF according to Cole et al . (Nature 393:537-544) and containing Rvl044 



1/1^3KQ ID 


NO: 


586 












n /1 1 

J X / XX 


















taa ttg 


tat 


gca 


aaa 




f af- 
L.CI L. 


C t-Gl 




gat 


acg att 


gcg 


cac 


atg 


get 


ate 


tgg 


gat 


cge 


OCH { 


jf & 


ala 


lys 


pro 


tvr 


leu 


ile 


asp 


thr ile 


did 


his 


met 


axa 


ile 


trp 


asp 


arg 


61 /21^ % ^ 




ID NO: ! 


587 














91/31 






etc gtc 


aacr 
y «y 




gee 


gec 


gag 


caa 


CaC 


ggc 


tac gtc 


acg 


act 


cge 


gat 


gcg 


cga 


gac 


ate 


leu val 


glu 


val 


axa 


ala 


glu 


gin 


his 


yxy 


tyr val 




thr 


arg 


asp 


ala 


arg 


asp 


ile 


121/41 


















131/ J X 












crcrc crte 

y y v- y v-*— 


gac 




gtg 


cag 




cge 


etc 


C Ca 


gcg ggg 


cge 


gga 


cgt 


ctt 


gag 


cgt 


gtc 


ggc 


y x y vax 


asp 


pro 


vai 


gin 


leu 


arg 


1 eu 


1 eu 


ala gly 


arg 


gly 


arg 


leu 


glu 


arg 


val 


gly 


1 R1 /fi1 


















Z X± / / X 














caa ocrt 


cr to 
y *-y 


CSC 


c gg 




ccc 


gtg 


c tg 


ccg 


cgt ggt 


gag 


cac 


gac 


gat 


etc 


gca 


gee 


gca 


s T*rr rrl v 

ci^y y j_ 


val 


tyr 


arg 


val 


pro 


val 


*i pi i 


LyX W 


arg gly 


glu 


his 


asp 


asp 


leu 


ala 


ala 


ala 


?41 /R1 

^ X / OX 


















1 X / Z? X 












crtcr tccr 

y *-y y 


t crcr 


d \— l_ 


teg 


ggg 


cgt 


ggc 


gtc 


acc 


teg cat 


gag 


teg 


gee 


ttg 


gcg 


ctt 


cat 


gee 


val ser 


t rp 


enr 


leu 


crlv 


arg 


crlv 


va 1 


ile 


OCX IlXo 


glu 


ser 


ala 


xeu 


ala 


leu 


his 


ala 


-J VJ _L / XUX 


















J J x / X x x 
















etc get 


gac 


gtg 


aac 


<~*y 


teg 


cge 


4- 
aCC 


Cat 


etc acc 


gtc 


ccg 


cge 


aac 


aac 


cat 


ccg 


cgt 


leu ala 




val 


asn 


pro 


ser 


wx y 


ile 


his 


leu thr 


val 


pro 


arg 


asn 


asn 


nis 


pro 


arg 


JOi / X*2 X 


















■JQ1 / 1 T 1 
J J X / 1J1 














y y ycc 


9* 9" 9* 


ggc 


gag 


c tg 


tac 


cga 


gtt 


cac 


cge cge 


gac 


etc 


cag 


gca 


gec 


cac 


gtc 


act 


ala ala 


gly 


gly 


glu 


1 €U 


tyr 


arg 


Val 


his 


arg arg 


asp 


leu 


gin 


ala 


ala 


his 


val 


thr 


421/141 


















451/151 


















teg gtc 


gac 


gga 


ata 


ccc 


gtc 


acg 


acg 


gtt 


gcg cge 


acc 


ate 


aaa 


gac 


tgc 


gtg 


aag 


acg 


ser val 


asp 


gly 


ile 


pro 


val 


thr 


thr 


val 


ala arg 


thr 


ile 


lys 


asp 


cys 


val 


lys 


thr 


481/161 


















511/171 














ggc acg 


gat 


cct 


tat 


cag 


ctt 


egg 


gee 


gcg 


ate gag 


cga 


gee 


gaa 


gee 


gag 


ggc 


acg 


ctt 


gly thr 


asp 


pro 


tyr 


gin 


leu 


arg 


ala 


ala 


ile glu 


arg 


ala 


glu 


ala 


glu 


gly 


thr 


leu 


541/181 


















571/191 














cgt cgt 


ggg 


tea 


gca 


get 


gag 


eta 


cge 


get 


gcg etc 


gat 


gag 


acc 


act 


gec 


gga 


tta 


cge 


arg arg 


gly 


ser 


ala 


ala 


glu 


leu 


arg 


ala 


ala leu 


asp 


glu 


thr 


thr 


ala 


gly 


leu 


arg 


601/201 
































get egg 


ccg 


aag 


cga 


gca 


teg 


gcg 


tga 






















ala arg 


pro 


lys 


arg 


ala 


ser 


ala)OPA 























SEQ ID NOS: 586-587 

FIG. 33E 



1/1 ^ — SEQ ID NO: 588 

ATC CAA CCT GCT GGG CCT GCG CCT TCG AAT 
( ile gin pro ala gly pro ala pro ser asn 
61/2"f^- SEQ ID NO: 589 

AC A AC A CCT GGA ATG GGG ACC TTT TCG GTG 
thr thr pro gly met gly thr phe ser val 
121/41 

CCT CGG TCG AGA CGT ATC GCG GCA GCG TTG 
pro arg ser arg arg ile ala ala ala leu 
181/61 

CGC CGC ATA TTT GCC GCG CTG CCG CGG GCC 
arg arg ile phe ala ala leu pro arg ala 



31/11 

CGA CGG CCA GGC CAC CGC TCG CTG CCG GCA 
arg arg pro gly his arg ser leu pro ala 
91/31 

TTG CTG GTA ACC GGG ACA ACC GGC ACC ACG 
leu leu val thr gly thr thr gly thr thr 
151/51 

GCC CTG TCG TTG CTG ACA ATT ACC GCT GGC 
ala leu ser leu leu thr ile thr ala gly 
211/71 
GGA TC 

giy) 



SEQ ID NOS: 588-589 



FIG. 34A 




1/1 ^SEQ ID NO: 


590 










TCC AAC 


CTG 


CTG 


GGC 


CTG 


CGC 


CTT 


CGA 


ATC 


(ser asn 


leu 


leu 


gly 


leu 


arg 


leu 


arg 


ile 


61/21^- 


SEQ 


ID NO: 591 










CAA CAC 


CTG 


GAA 


TGG 


GGA 


CCT 


TTT 


CGG 


TGT 


gin his 


leu 


glu 


trp 


gly 


pro 


phe 


arg 


cys 


121/41 


















CTC GGT 


CGA 


GAC 


GTA 


TCG 


CGG 


CAG 


CGT 


TGG 


leu gly 


arg 


asp 


val 


ser 


arg 


gin 


arg 


trp 


181/61 


















GCC GCA 


TAT 


TTG 


CCG 


CGC 


TGC 


CGC 


GGG 


CCG 


ala ala 


tyr 


leu 


pro 


arg 


cys 


arg 


gly 


pro 



31/11 

GAC GGC CAG GCC 
asp gly gin ala 
91/31 

TGC TGG TAA CCG 
cys trp)OCH(pro 

151/51 < - 

CCC TGT CGT TGC 

pro cys arg cys ) 

211/71 

GAT C 

asp) 



ACC GCT CGC TGC CGG CAA 
thr ala arg cys arg gin 

GGA CAA CCG GCA CCA CGC 
gly gin pro ala pro arg 
SEQ ID NO: 592 
TGA CAA TTA CCG CTG GCC 
OPA(gln leu pro leu ala 
SEQ ID NO: 593 



SEQ ID NOS: 590-593 

FIG. 34B 



1/1 SEQ ID NO: 594 31/11 

GAT CCA ACC TGC TGG GCC TGC GCC TTC GAA TCG ACG GCC AGG CCA CCG CTC GCT GCC GGC 
( asp pr o thr cys trp ala cys ala phe glu ser thr ala arg pro pro leu ala ala gly 
61/21 SEQ ID NO: 595 91/31 

AAC AAC ACC TGG AAT GGG GAC CTT TTC GGT GTT GCT GGT AAC CGG GAC AAC CGG CAC CAC 
asn asn thr trp asn gly asp leu phe gly val ala gly asn arg asp asn arg his his 
121/41 151/51 

GCC TCG GTC GAG ACG TAT CGC GGC AGC GTT GGC CCT GTC GTT GCT GAC AAT TAC CGC TGG 
ala ser val glu thr tyr arg gly ser val gly pro val val ala asp asn tyr arg trp 
181/61 211/71 
CCG CCG CAT ATT TGC CGC GCT GCC GCG GGC CGG ATC 
pro pro his ile cys arg ala ala ala gly arg ile) 



SEQ ID NOS: 594-595 



FIG. 34C 



ORF according to Cole et al . (Nature 39: 
1/ 1 s SEQ ID NO: 596 

tag ccg cag ggc cct gcg get agg cgc ggc 
AMB(pro gin gly pro ala ala arg arg gly 
61/2T^~SEQ ID NO: 597 

ttg cag cag tta caa cgc caa atg gag tct 
leu gin gin leu gin arg gin met glu ser 
121/41 

agg gga gac gtt gcg cag cga egg ate caa 
arg gly asp val ala gin arg arg ile gin 
181/61 

cca ggc cac cgc teg ctg ccg gca aca aca 
pro gly his arg ser leu pro ala thr thr 
241/81 

gta acc ggg aca acc ggc acc acg cct egg 
val thr gly thr thr gly thr thr pro arg 
301/101 

teg ttg ctg aca att acc get ggc cgc cgc 
ser leu leu thr ile thr ala gly arg arg 
361/121 

agg teg acc tgc cag ate tea ccg cgc age 
arg ser thr cys gin ile ser pro arg ser 
421/141 

gcg acg gca ggc cca etc tct tgg cat gcg 
ala thr ala gly pro leu ser trp his ala 
481/161 

etc acg ctt ggc ttc atg ccg cag tec tac 
leu thr leu gly phe met pro gin ser tyr 



: 537-544) containing seq34A 



31/11 



egg tgc 


cgt 


tgg 


ccg 


egg 


egg 


caa 


teg 


atg 


arg cys 


arg 


trp 


pro 


arg 


arg 


gin 


ser 


met 






91/31 












gag cgc 


ate 


gtc 


gag 


ttc 


gat 


cag 


etc 


ggc 


glu arg 


i le 


val 


glu 


phe 


asp 


gin 


leu 


gly 


151/51 


















cct get 


ggg 


cct 


gcg 


cct 


teg 


aat 


cga 


egg 


pro ala 


gly 


pro 


ala 


pro 


ser 


asn 


arg 


arg 


211/71 


















cct gga 


atg 


ggg 


acc 


ttt 


teg 


gtg 


ttg 


ctg 


pro gly 


met 


gly 


thr 


phe 


ser 


val 


leu 


leu 


271/91 


















teg aga 


cgt 


ate 


gcg 


gca 


gcg 


ttg 


gee 


ctg 


ser arg 


arg 


ile 


ala 


ala 


ala 


leu 


ala 


leu 


331/111 


















ata ttt 


gee 


gcg 


ctg 


ccg 


egg 


gee 


gga 


tec 


ile phe 


ala 


ala 


leu 


pro 


arg 


ala 


gly 


ser 


3.91/131 


















ate tac 


gee 


gtt 


cgc 


tgc 


aaa 


ccg 


ccg 


act 


ile tyr 


ala 


val 


arg 


cys 


lys 


pro 


pro 


thr 


451/151 


















tec aat 


get 


gcg 


acg 


tec 


teg 


gta 


gac 


aag 


ser asn 


ala 


ala 


thr 


ser 


ser 


val 


asp 


lys 


511/171 


















cca tgt 


agt 


aac 


aga 


tag 










pro cys 


ser 


asn 


arg)AMB 











SEQ ID NOS: 596-597 

FIG. 34E 



1/1^-SEQ ID 


NO: 


598 












31/11 


















CAG TCT 


GTC 


GGC 


AAG 


GAG 


GGA 


CGC 


ATG 


CCA 


CTC TCC 


GAT 


CAT 


GAG 


CAG 


CGG 


ATG 


CTT 


GAC 


(gin ser 


val 


gly 


lys 


glu 


gly 


arg 


met 


pro 


leu ser 


asp 


his 


glu 


gin 


arg 


met 


leu 


asp 


(TT/21 ^ 


— SEQ ID 


NO: 


599 










91/31 


















CAG ATC 


GAG 


AGC 


GCT 


CTC 


TAC 


GCC 


GAA 


GAT 


CCC AAG 


TTC 


GCA 


TCG 


AGT 


GTC 


CGT 


GGC 


GGG 


gin ile 


glu 


ser 


ala 


leu 


tyr 


ala 


glu 


asp 


pro lys 


phe 


ala 


ser 


ser 


val 


arg 


gly 


gly 


121/41 


















151/51 


















GGC TTC 


CGC 


GCA 


CCG 


ACC 


GCG 


CGG 


CGG 


CGC 


CTG CAG 


GGC 


GCG 


GCG 


TTG 


TTC 


ATC 


ATC 


GGT 


gly phe 


arg 


ala 


pro 


thr 


ala 


arg 


arg 


arg 


leu gin 


gly 


ala 


ala 


leu 


phe 


ile 


ile 


gly 


181/61 


















211/71 


















CTG GGG 


ATG 


TTG 


GTT 


TCC 


GGC 


GTG 


GCG 


TTC 


AAA GAG 


ACC 


ATG 


ATC 


GGA 


AGT 


TTC 


CCG 


ATA 


leu gly 


met 


leu 


val 


ser 


gly 


val 


ala 


phe 


lys glu 


thr 


met 


ile 


gly 


ser 


phe 


pro 


ile 


241/81 


















271/91 


















CTC AGC 


GTT 


TTC 


GGT 


TTT 


GTC 


GTG 


ATG 


TTC 


GGT GGT 


GTG 


GTG 


TAT 


GCC 


ATC 


ACC 


GGT 


CCT 


leu ser 


val 


phe 


gly 


phe 


val 


val 


met 


phe 


gly gly 


val 


val 


tyr 


ala 


ile 


thr 


gly 


pro 


301/101 


















331/111 


















CGG TTG 


TCC 


GGC 


AGG 


ATG 


GAT 


CGT 


GGC 


GGA 


TCG GCT 


GCT 


GGG 


GCT 


TCG 


CGC 


CAG 


CGT 


CGT 


arg leu 


ser 


gly 


arg 


met 


asp 


arg 


gly 


gly 


ser ala 


ala 


gly 


ala 


ser 


arg 


gin 


arg 


arg 


361/121 


















391/131 


















ACC AAG 


GGG 


GCC 


GGG 


GGC 


TCA 


TTC 


ACC 


AGC 


CGT ATG 


GAA 


GAT 


C 












thr lys 


gly 


ala 


gly 


gly 


ser 


phe 


thr 


ser 


arg met 


glu 


asp) 















SEQ ID NOS: 598-599 



FIG. 35A 




1/1 _^-SEQ ID NO: 600 










31/11 


















GAC AGT 


CTG 


TCG 


GCA 


AGG 


AGG 


GAC 


GCA 


TGC 


CAC TCT 


CCG 


ATC 


ATG 


AGC 


AGC 


GGA 


TGC 


TTG 


(asp ser 


leu 


ser 


ala 


arg 


arg 


asp 


ala 


cys 


his ser 


pro 


ile 


met 


ser 


ser 


gly 


cys 


leu 


61/21 ' SEQ ID 


NO: 


601 














91/31 












ACC AGA 


TCG 


AGA 


GCG 


CTC 


TCT 


ACG 


CCG 


AAG 


ATC CCA 


AGT 


TCG 


CAT 


CGA 


GTG 


TCC 


GTG 


GCG 


thr arg 


ser 


arg 


ala 


leu 


ser 


thr 


pro 


lys 


ile pro 


ser 


ser 


his 


arg 


val 


ser 


val 


ala 


121/41 


















151/51 


















GGG GCT 


TCC 


GCG 


CAC 


CGA 


CCG 


CGC 


GGC 


GGC 


GCC TGC 


AGG 


GCG 


CGG 


CGT 


TGT 


TCA 


TCA 


TCG 


gly ala 


ser 


ala 


his 


arg 


pro 


arg 


gly 


gly 


ala cys 


arg 


ala 


arg 


arg 


cys 


ser 


ser 


ser 


181/61 


















211/71 


















GTC TGG 


GGA 


TGT 


TGG 


TTT 


CCG 


GCG 


TGG 


CGT 


TCA AAG 


AGA 


CCA 


TGA 


TCG 


GAA 


GTT 


TCC 


CGA 


val trp 


gly 


cys 


trp 


phe 


pro 


ala 


trp 


arg 


ser lys 


arg 


pro)OPA(ser 


glu 


val 


ser 


arg 


241/81 


















271/91 








SEQ ID NO: 602 


TAC TCA 


GCG 


TTT 


TCG 


GTT 


TTG 


TCG 


TGA 


TGT 


TCG GTG 


GTG 


TGG 


TGT 


ATG 


CCA 


TCA 


CCG 


GTC 


tyr ser 


ala 


phe 


ser 


val 


leu 


ser ) OPA(cys 


ser val 


val 


trp 


cys 


met 


pro 


ser 


pro 


val 


301/101 






SEQ ID NO: 603-^ 




331/111 


















CTC GGT 


TGT 


CCG 


GCA 


GGA 


TGG 


ATC 


GTG 


GCG 


GAT CGG 


CTG 


CTG 


GGG 


CTT 


CGC 


GCC 


AGC 


GTC 


leu gly 


cys 


pro 


ala 


gly 


trp 


Lie 


val 


ala 


asp arg 


leu 


leu 


gly 


leu 


arg 


ala 


ser 


val 


361/121 


















391/131 


















GTA CCA 


AGG 


GGG 


CCG 


GGG 


GCT 


CAT 


TCA 


CCA 


GCC GTA 


TGG 


AAG 


ATC 












val pro 


arg 


gly 


pro 


gly 


ala 


his 


ser 


pro 


ala val 


trp 


lys 


ile) 











SEQ XD NOS: 600-603 

FIG 35B 



1/1 _^SEQ ID NO: 604 










31/11 


















ACA GTC 


TGT 


CGG 


CAA 


GGA 


GGG 


ACG 


CAT 


GCC 


ACT CTC 


CGA 


TCA 


TGA 


GCA 


GCG 


GAT 


GCT 


TGA 


(thr val 


cys 


arg 


gin 


gly 


gly 


thr 


his 


ala 


thr leu 


arg 


ser)OPA(ala 


ala 


asp 


ala)OPA 


61/21 ' 


— SEQ ID NO: 


605 








91/31 








SEQ ID NO: 


606 


CCA GAT 


CGA 


GAG 


CGC 


TCT 


CTA 


CGC 


CGA 


AGA 


TCC CAA 


GTT 


CGC 


ATC 


GAG 


TGT 


CCG 


TGG 


CGG 


(pro asp 


arg 


glu 


arg 


ser 


leu 


arg 


arg 


arg 


ser gin 


val 


arg 


ile 


glu 


cys 


pro 


trp 


arg 


121/41^ 


- — SEQ ID NO: 


607 








151/51 


















GGG CTT 


CCG 


CGC 


ACC 


GAC 


CGC 


GCG 


GCG 


GCG 


CCT GCA 


GGG 


CGC 


GGC 


GTT 


GTT 


CAT 


CAT 


CGG 


gly leu 


pro 


arg 


thr 


asp 


arg 


ala 


ala 


ala 


pro ala 


gly 


arg 


gly 


val 


val 


his 


his 


arg 


181/61 


















211/71 


















TCT GGG 


GAT 


GTT 


GGT 


TTC 


CGG 


CGT 


GGC 


GTT 


CAA AGA 


GAC 


CAT 


GAT 


CGG 


AAG 


TTT 


CCC 


GAT 


ser gly 


asp 


val 


gly 


phe 


arg 


arg 


gly 


val 


gin arg 


asp 


his 


asp 


arg 


lys 


phe 


pro 


asp 


241/81 


















271/91 


















ACT CAG 


CGT 


TTT 


CGG 


TTT 


TGT 


CGT 


GAT 


GTT 


CGG TGG 


TGT 


GGT 


J GTA 


TGC 


CAT 


CAC 


CGG 


TCC 


thr gin 


arg 


phe 


arg 


phe 


cys 


arg 


asp 


val 


arg trp 


cys 


gly 


val 


cys 


his 


his 


arg 


ser 


301/101 


















331/111 
















TCG 


TCG GTT 


GTC 


CGG 


CAG 


GAT 


GGA 


TCG 


TGG 


CGG 


ATC GGC 


TGC 


TGG 


GGC 


TTC 


GCG 


CCA 


GCG 


ser val 


val 


arg 


gin 


asp 


gly 


ser 


trp 


arg 


ile gly 


cys 


trp 


gly 


phe 


ala 


pro 


ala 


ser 


361/121 


















391/131 


















TAC CAA 


GGG 


GGC 


CGG 


GGG 


CTC 


ATT 


CAC 


CAG 


CCG TAT 


GGA 


AGA 


TC 












tyr gin 


gly 


gly 


arg 


gly 


leu 


ile 


his 


gin 


pro tyr 


gly 


arg) 













SEQ ID NOS: 604-607 



FIG. 35C 



sequence Rv2169c predicted by Cole et ai 
partially containing seq35A 

1/1 ^^SEQ ID NOs 608 

atg cca etc tec gat cat gag cag egg atg 
( Met p ro leu ser asp his glu gin arg met 
61/21 SEQ ID NO: 609 

gaa gat ccc aag ttc gca teg agt gtc cgt 
glu asp pro lys phe ala ser ser val arg 
121/41 

egg cgc ctg cag ggc gcg gcg ttg ttc ate 
arg arg leu gin gly ala ala leu phe ile 
181/61 

gcg ttc aaa gag ace atg ate gga agt ttc 
ala phe lys glu thr met ile gly ser phe 
241/81 

atg ttc ggt ggt gtg gtg tat gee ate acc 
met phe gly gly val val tyr ala ile thr 
301/101 

ggc gga teg get get ggg get teg cgc cag 
gly gly ser ala ala gly ala ser arg gin 
361/121 

acc age cgt atg gaa gat egg ttc egg cgc 
thr ser arg met glu asp arg phe arg arg 



(Nature 393:537-544) and 
31/11 

ctt gac cag ate gag age get etc tac gee 
leu asp gin ile glu ser ala leu tyr ala 
91/31 

ggc ggg ggc ttc cgc gca ccg acc gcg egg 
gly gly gly phe arg ala pro thr ala arg 
151/51 

ate ggt ctg ggg atg ttg gtt tec ggc gtg 
ile gly leu gly met leu val ser gly val 
211/71 

ccg ata etc age gtt ttc ggt ttt gtc gtg 
pro ile leu ser val phe gly phe val val 
271/91 

ggt cct egg ttg tec ggc agg atg gat cgt 
gly pro arg leu ser gly arg met asp arg 
331/111 

cgt cgt acc aag ggg gee ggg ggc tea ttc 
arg arg thr lys gly ala gly gly ser phe 
391/131 

cgc ttc gac gag taa 
arg phe asp glu)OCH 



SEQ XD NOS: 608-609 

FIG. 35D 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv2169c 



1/1^-SEQ ID NO: 


610 










31/11 


















tga cag 


tct 


gtc 


ggc 


aag 


gag 


gga 


cgc 


atg 


cca etc 


tec 


gat 


cat 


gag 


cag 


egg 


atg 


ctt 


OPA(gln 


ser 


val 


gly 


lys 


glu 


gly 


arg 


met 


pro leu 


ser 


asp 


his 


glu 


gin 


arg 


met 


leu 


61/21^- 


SEQ 


ID NO: 6X1 










91/31 


















gac cag 


ate 


gag 


age 


get 


etc 


tac 


gee 


gaa 


gat ccc 


aag 


ttc 


gca 


teg 


agt 


gtc 


cgt 


ggc 


asp gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala 


glu 


asp pro 


lys 


phe 


ala 


ser 


ser 


val 


arg 


gly 


121/41 


















151/51 


















ggg ggc 


ttc 


cgc 


gca 


ccg 


acc 


gcg 


egg 


egg 


cgc ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 


ate 


gly gly 


phe 


arg 


ala 


pro 


thr 


ala 


arg 


arg 


arg leu 


gin 


gly 


ala 


ala 


leu 


phe 


ile 


ile 


181/61 


















211/71 


















ggt ctg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


gtg 


gcg 


ttc aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ttc 


ccg 


gly leu 


gly 


met 


leu 


val 


ser 


gly 


val 


ala 


phe lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 


pro 


241/81 


















271/91 


















ata etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg 


atg 


ttc ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


acc 


ggt 


ile leu 


ser 


val 


phe 


gly 


phe 


val 


val 


met 


phe gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 


gly 


301/101 


















331/111 


















cct egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt 


ggc 


gga teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 


cgt 


pro arg 


leu 


ser 


gly 


arg 


met 


asp 


arg 


gly 


gly ser 


ala 


ala 


gly 


ala 


ser 


arg 


gin 


arg 


361/121 


















391/131 


















cgt acc 


aag 


ggg 


gec 


ggg 


ggc 


tea 


ttc 


acc 


age cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 


cgc 


arg thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe 


thr 


ser arg 


met 


glu 


asp 


arg 


phe 


arg 


arg 


arg 



421/141 

ttc gac gag taa 
phe asp glu)OCH 

SEQ XD NOS: 610-611 

FIG. 35E 



1/1 ^-SEQ ID NO: 612 










GAC CTG 


GGA 


CGA 


AGA 


CGA 


CGG 


CAG 


CAG 


CCG 


(asp leu 


gly 


arg 


arg 


arg 


arg 


gin 


gin 


pro 


61/21 


-SEQ ID 


NO: 


613 










CAA TGG 


ACA 


CCC 


GAC 


TAC 


GGT 


GCG 


CCT 


GCG 


gin trp 


thr 


pro 


asp 


tyr 


gly 


ala 


pro 


ala 


121/41 


















GTG GTC 


GGA 


GTG 


CCA 


CCC 


GAC 


CAG 


GCC 


ACC 


val val 


gly 


val 


pro 


pro 


asp 


gin 


ala 


thr 


181/61 


















ACG GCG 


CCG 


GTC 


TGG 


ATC 


ACC 


ATG 


CTG 


TGG 


thr ala 


pro 


val 


trp 


ile 


thr 


met 


leu 


trp 


241/81 


















GGG GCA 


CCC 


GGT 


GGC 


ACC 


GTT 


CCC 


GTC 


CGG 


gly ala 


pro 


gly 


gly 


thr 


val 


pro 


val 


arg 


301/101 


















GCC AAC 


GGC 


GGC 


CGG 


CTG 


GAC 


ATC 


CTC 


CTG 


ala asn 


gly 


gly 


arg 


leu 


asp 


ile 


leu 


leu 


361/121 


















GTC GAC 


CCC 


GAC 


GGC 


GCC 


GTC 


GGC 


CGA 


GCG 


val asp 


pro 


asp 


gly 


ala 


val 


gly 


arg 


ala 



31/11 

CAA TCA GAT CTA CCC GGT CCT GGT CAA CGT 
gin ser asp leu pro gly pro gly gin arg 
91/31 

CGG CTC GAC AAT GCG CGG TTC CTG TTG CCC 
arg leu asp asn ala arg phe leu leu pro 
151/51 

GAC TTC GGC TCC GCT GTT GCA CCA GAA ACG 
asp phe gly ser ala val ala pro glu thr 
211/71 

CCG CTG GCC GAC CGG CCC CGG TTG GCC CCC 
pro leu ala asp arg pro arg leu ala pro 
271/91 

CTG GTC GAC GAC GAC CTG GCA AAC TCG CTG 
leu val asp asp asp leu ala asn ser leu 
331/111 

TCG GCG GCC GAG TTC GCC ACC AAC CGG GAA 
ser ala ala glu phe ala thr asn arg glu 
391/131 

CTG TGC CTG GCC ATC GAC CCA GAT C 
leu cys leu ala ile asp pro asp) 



SEQ ID NOS: 612-613 

FIG. 36A 



1/1 — SEQ ID NO: 614 










31/11 


















ACC TGG 


GAC 


GAA 


GAC 


GAC 


GGC 


AGC 


AGC 


CGC 


AAT CAG 


ATC 


TAC 


CCG 


GTC 


CTG 


GTC 


AAC 


GTC 


(thr trp 


asp 


glu 


asp 


asp 


gly 


ser 


ser 


arg 


asn gin 


ile 


tyr 


pro 


val 


leu 


val 


asn 


val 


61/21 


-SEQ ID 


NO: 


615 










91/31 


















AAT. GGA 


CAC 


CCG 


ACT 


ACG 


GTG 


CGC 


CTG 


CGC 


GGC TCG 


ACA 


ATG 


CGC 


GGT 


TCC 


TGT 


TGC 


CCG 


asn gly 


his 


pro 


thr 


thr 


val 


arg 


leu 


arg 


gly ser 


thr 


met 


arg 


gly 


ser 


cys 


cys 


pro 


121/41 


















151/51 


















TGG TCG 


GAG 


TGC 


CAC 


CCG 


ACC 


AGG 


CCA 


CCG 


ACT TCG 


GCT 


CCG 


CTG 


TTG 


CAC 


CAG 


AAA 


CGA 


trp ser 


glu 


cys 


his 


pro 


thr 


arg 


pro 


pro 


thr ser 


ala 


pro 


leu 


leu 


his 


gin 


lys 


arg 


181/61 


















211/71 


















CGG CGC 


CGG 


TCT 


GGA 


TCA 


CCA 


TGC 


TGT 


GGC 


CGC TGG 


CCG 


ACC 


GGC 


CCC 


GGT 


TGG 


CCC 


CCG 


arg arg 


arg 


ser 


gly 


ser 


pro 


cys 


cys 


gly 


arg trp 


pro 


thr 


gly 


pro 


gly 


trp 


pro 


pro 


241/81 


















271/91 


















GGG CAC 


CCG 


GTG 


GCA 


CCG 


TTC 


CCG 


TCC 


GGC 


TGG TCG 


ACG 


ACG 


ACC 


TGG 


CAA 


ACT 


CGC 


TGG 


gly his 


pro 


val 


ala 


pro 


phe 


pro 


ser 


gly 


trp ser 


thr 


thr 


thr 


trp 


gin 


thr 


arg 


trp 


301/101 


















331/111 


















CCA ACG 


GCG 


GCC 


GGC 


TGG 


ACA 


TCC 


TCC 


TGT 


CGG CGG 


CCG 


AGT 


TCG 


CCA 


CCA 


ACC 


GGG 


AAG 


pro thr 


ala 


ala 


gly 


trp 


thr 


ser 


ser 


cys 


arg arg 


pro 


ser 


ser 


pro 


pro 


thr 


gly 


lys 


361/121 


















391/131 


















TCG ACC 


CCG 


ACG 


GCG 


CCG 


TCG 


GCC 


GAG 


CGC 


TGT GCC 


TGG 


CCA 


TCG 


ACC 


CAG 


ATC 






ser thr 


pro 


thr 


ala 


pro 


ser 


ala 


glu 


arg 


cys ala 


trp 


pro 


ser 


thr 


gin 


ile) 





SEQ ID NOS: 614-615 



FIG. 36B 




1/1 ^-SEQ ID NO: 616 31/11 



CCT GGG 


ACG 


AAG 


ACG 


ACG 


GCA 


GCA 


GCC 


GCA 


ATC AGA 


TCT 


ACC 


CGG 


TCC 


TGG 


TCA 


ACG 


TCA 


(pro gly 


thr 


lys 


thr 


thr 


ala 


ala 


ala 


ala 


ile arg 


ser 


thr 


arg 


ser 


trp 


ser 


thr 


ser 


61/21^" 


SEQ 


ZD NO: 617 










91/31 


















ATG GAC 


ACC 


CGA 


CTA 


CGG 


TGC 


GCC 


TGC 


GCG 


GCT CGA 


CAA 


TGC 


GCG 


GTT 


CCT 


GTT 


GCC 


CGT 


met asp 


thr 


arg 


leu 


arg 


cys 


ala 


cys 


ala 


ala arg 


gin 


cys 


ala 


val 


pro 


val 


ala 


arg 


121/41 


















151/51 


















GGT CGG 


AGT 


GCC 


ACC 


CGA 


CCA 


GGC 


CAC 


CGA 


CTT CGG 


CTC 


CGC 


TGT 


TGC 


ACC 


AGA 


AAC 


GAC 


gly arg 


ser 


ala 


thr 


arg 


pro 


gly 


his 


arg 


leu arg 


leu 


arg 


cys 


cys 


thr 


arg 


asn 


asp 


181/61 


















211/71 


















GGC GCC 


GGT 


CTG 


GAT 


CAC 


CAT 


GCT 


GTG 


GCC 


GCT GGC 


CGA 


CCG 


GCC 


CCG 


GTT 


GGC 


CCC 


CGG 


gly ala 


gly 


leu 


asp 


his 


his 


ala 


val 


ala 


ala gly 


arg 


pro 


ala 


pro 


val 


gly 


pro 


arg 


241/81 


















271/91 


















GGC ACC 


CGG 


TGG 


CAC 


CGT 


TCC 


CGT 


CCG 


GCT 


GGT CGA 


CGA 


CGA 


CCT 


GGC 


AAA 


CTC 


GCT 


GGC 


gly thr 


arg 


trp 


his 


arg 


ser 


arg 


pro 


ala 


gly arg 


arg 


arg 


pro 


gly 


lys 


leu 


ala 


gly 


301/101 


















331/111 


















CAA CGG 


CGG 


CCG 


GCT 


GGA 


CAT 


CCT 


CCT 


GTC 


GGC GGC 


CGA 


GTT 


CGC 


CAC 


CAA 


CCG 


GGA 


AGT 


gin arg 


arg 


pro 


ala 


gly 


his 


pro 


pro 


val 


gly gly 


arg 


val 


arg 


his 


gin 


pro 


gly 


ser 


361/121 


















391/131 


















CGA CCC 


CGA 


CGG 


CGC 


CGT 


CGG 


CCG 


AGC 


GCT 


GTG CCT 


GGC 


CAT 


CGA 


CCC 


AGA 


TC 






arg pro 


arg 


arg 


arg 


arg 


arg 


pro 


ser 


ala 


val pro 


gly 


his 


arg 


pro 


arg) 







SEQ ID NOS: 616-617 

FIG. 36C 



Coding sequence Rv3909 predicted by Cole et al . , 1998 (Nature 393 
537-544) containing Seq 36A 



1/1 _^-SEQ ID NO: 618 










31/11 


















GTG ACC 


GCA 


CTG 


CAA 


CTC 


GGC 


TGG 


GCC 


GCT 


TTG GCG 


CGC 


GTC 


ACC 


TCA 


GCG 


ATC 


GGC 


GTC 


(met thr 


ala 


leu 


gin 


leu 


gly 


trp 


ala 


ala 


leu ala 


arg 


val 


thr 


ser 


ala 


ile 


gly 


val 


61/21^- 


-SEQ 


ZD NO: 619 










91/31 


















GTG GCC 


GGC 


CTC 


GGG 


ATG 


GCG 


CTC 


ACG 


GTA 


CCG TCG 


GCG 


GCA 


CCG 


CAC 


GCG 


CTC 


GCA 


GGC 


val ala 


gly 


leu 


gly 


met 


ala 


leu 


thr 


val 


pro ser 


ala 


ala 


pro 


his 


ala 


leu 


ala 


gly 


121/41 


















151/51 


















GAG CCC 


AGC 


CCG 


ACG 


CCT 


TTT 


GTC 


CAG 


GTC 


CGC ATC 


GAT 


CAG 


GTG 


ACC 


CCG 


GAC 


GTG 


GTG 


glu pro 


ser 


pro 


thr 


pro 


phe 


val 


gin 


val 


arg ile 


asp 


gin 


val 


thr 


pro 


asp 


val 


val 


181/61 


















211/71 


















ACC ACT 


TCC 


AGC 


GAA 


CCC 


CAT 


GTC 


ACC 


GTC 


AGC GGA 


ACG 


GTG 


ACC 


AAT 


ACC 


GGT 


GAC 


CGC 


thr thr 


ser 


ser 


glu 


pro 


his 


val 


thr 


val 


ser gly 


thr 


val 


thr 


asn 


thr 


gly 


asp 


arg 


241/81 


















271/91 


















CCA GTC 


CGC 


GAT 


GTG 


ATG 


GTC 


CGG 


CTT 


GAG 


CAC GCC 


GCC 


GCG 


GTC 


ACG 


TCG 


TCA 


ACG 


GCG 


pro val 


arg 


asp 


val 


met 


val 


arg 


leu 


glu 


his ala 


ala 


ala 


val 


thr 


ser 


ser 


thr 


ala 


301/101 


















331/111 


















TTA CGC 


ACC 


TCG 


CTC 


GAC 


GGC 


GGC 


ACC 


GAC 


CAG TAC 


CAG 


CCG 


GCC 


GCG 


GAC 


TTC 


CTC 


ACG 


leu arg 


thr 


ser 


leu 


asp 


gly 


gly 


thr 


asp 


gin tyr 


gin 


pro 


ala 


ala 


asp 


phe 


leu 


thr 



SEQ ID NOS: 618-619 

FIG. 36D 

i 



361/121 

GTC GCC CCC GAA CTA GAC CGC GGG CAA GAG 
val ala pro glu leu asp arg gly gin glu 
421/141 

TCG CTG ACC AGG CCG TCG TTG GCC GTC AAC 
ser leu thr arg pro ser leu ala val asn 
481/161 

GTC AAT GGG ACA CCC GAC TAC GGT GCG CCT 
val asn gly thr pro asp tyr gly ala pro 
541/181 

CCC GTG GTC GGA GTG CCA CCC GAC CAG GCC 
pro val val gly val pro pro asp gin ala 
601/201 

ACG ACG GCG CCG GTC TGG ATC ACC ATG CTG 
thr thr ala pro val trp ile thr met leu 
661/221 

CCC GGG GCA CCC GGT GGC ACC GTT CCC GTC 
pro gly ala pro gly gly thr val pro val 
721/241 

CTG GCC AAC GGC GGC CGG CTG GAC ATC CTC 
leu ala asn gly gly arg leu asp ile leu 
781/261 

GAA GTC GAC CCC GAC GGC GCC GTC GGC CGA 
glu val asp pro asp gly ala val gly arg 
841/281 

CTC ATC ACC GTC AAT GCG ATG ACC GGC GGC 
leu ile thr val asn ala met thr gly gly 
901/301 

GCT CAA CTA CCG GGC ACC CCG ACC CAC CCG 
ala gin leu pro gly thr pro thr his pro 
961/321 

CTG GAT CGA TTG CGG ACG CTA GTC CAC CGG 
leu asp arg leu arg thr leu val his arg 
1021/341 

GCC GAC CTG GAT GCT TTG CAG CGG GTT AAT 
ala asp leu asp ala leu gin arg val asn 
1081/361 

AGC CCC GCC GAC ATC GTC GAC CGC ATC CTG 
ser pro ala asp ile val asp arg ile leu 
1141/381 

CTG CCC GAC GGC CCG TTG ACC GGC CGG GCG 
leu pro asp gly pro leu thr gly arg ala 
1201/401 

GTT GCC GTC GCG GCC GCC GAT TTT AGC CCC 
val ala val ala ala ala asp phe ser pro 
1261/421 

TCC GCG CTC TTA CCC GCT ACC GCG CCC CGG 
ser ala leu leu pro ala thr ala pro arg 
1321/441 

TTT GAT CCC GCG GTC GGG GCC GCG CTG GCC 
phe asp pro ala val gly ala ala leu ala 
1381/461 

TAT CTA GAT CCC TCG TTG TTC GTT CGG ATC 
tyr leu asp pro ser leu phe val arg ile 



391/131 

GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC 
ala gly phe thr leu ser ala pro leu arg 
451/151 

CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC 
gin pro gly ile tyr pro val leu val asn 
511/171 

GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG 
ala arg leu asp asn ala arg phe leu leu 
571/191 

ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA 
thr asp phe gly ser ala val ala pro glu 
631/211 

TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC 
trp pro leu ala asp arg pro arg leu ala 
691/231 

CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG 
arg leu val asp asp asp leu ala asn ser 
751/251 

CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG 
leu ser ala ala glu phe ala thr asn arg 
811/271 

GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA 
ala leu cys leu ala ile asp pro asp leu 
871/291 

TAC GTC GTG TCC GAC TCG CCC GAC GGG GCC 
tyr val val ser asp ser pro asp gly ala 
931/311 

GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG 
gly thr gly gin ala ala ala ser ser trp 
991/331 

ACA TGC GTG ACG CCG CTG CCT TTT GCC CAA 
thr cys val thr pro leu pro phe ala gin 
1051/351 

GAT CCG AGG CTG AGC GCG ATC GCA ACC ATC 
asp pro arg leu ser ala ile ala thr ile 
1111/371 

GAT GTC AGC TCC ACC CGC GGC GCA ACC GTG 
asp val ser ser thr arg gly ala thr val 
1171/391 

ATC AAC TTG CTC AGC ACC CAC GGC AAC ACG 
ile, asn leu leu ser thr his gly asn thr 
1231/411 

GAG GAA CAG CAG GGT TCG TCC CAG ATC GGC 
glu glu gin gin gly ser ser gin ile gly 
1291/431 

CGG TTG TCC CCG CGG GTG GTA GCG GCG CCG 
arg leu ser pro arg val val ala ala pro 
1351/451 

GCC GCG GGA ACA AAC CCG ACC GTT CCT ACC 
ala ala gly thr asn pro thr val pro thr 
1411/471 

GCG CAT GAA TCG ATC ACC GCG CGC CGC CAG 
ala his glu ser ile thr ala arg arg gin 



SEQ ZD NOS: 618-619 (continued 1) 



FIG. 36D (continued 1) 



it 



1441/481 

GAC GCC TTG GGC GCA ATG CTG TGG CGC AGC 
asp ala leu gly ala met leu trp arg ser 
1501/501 

ATC CTG GTG CCG CCG GCG TCG TGG AGC CTG 
ile leu val pro pro ala ser trp ser leu 
1561/521 

GCG CTG GCC ACC GCC ATC CGG TCT GGC CTG 
ala leu ala thr ala ile arg ser gly leu 
1621/541 

GCT GAC GCC GCG GCC CGC ACC GAG CCA CCG 
ala asp ala ala ala arg thr glu pro pro 
1681/561 

GGC CGG TTC AAT GAC GAC ATC ACC ACG CAG 
gly arg phe asn asp asp ile thr thr gin 
1741/581 

CTG ACC TCG GCG TTG ACC ATC GAT GAC CGC 
leu thr ser ala leu thr ile asp asp arg 
1801/601 

CCA CTA CGC GAG GAC ATG TTG CGC GCG CTG 
pro leu arg glu asp met leu arg ala leu 
1861/621 

GGG CTG GCC CAG CAG CGG CTG GCC GTC GTT 
gly leu ala gin gin arg leu ala val val 
1921/641 

GTG ACC ATC GTC AAC CCG GGC GGC TCC TAC 
val thr ile val asn pro gly gly ser tyr 
1981/661 

TTG GCG CTG CAT AAT GGC CTC GCC GTG CCA 
leu ala leu his asn gly leu ala val pro 
2041/681 

CCC GGG ATG ACG GTG GCC GAT GTC GGT CAG 
pro gly met thr val ala asp val gly gin 
2101/701 

CGA GTA CCA ATC GAG GTG AAC TTC AC A CAG 
arg val pro ile glu val asn phe thr gin 
2161/721 

CCC GAC GGC GTC GCG CTG GGT GAA CCG GTG 
pro asp gly val ala leu gly glu pro val 
2221/741 

AAG GTG TTG TTC GCG ATC ACG CTA TCC GCT 
lys val leu phe ala ile thr leu ser ala 
2281/761 

CGC CTT TGG CAC CGG TTC CGT GGC CAG CCT 
arg leu trp his arg phe arg gly gin pro 
2341/781 

CCT ACC GGC AAA CAC GCC CCG CAG CGC CGT 
pro thr gly lys his ala pro gin arg arg 
2401/801 
CGG GTA TGA 
arg vaDOPA 



1471/491 

TTG GAG CCG AAT GCC GCG CCC CGT ACC CAA 
leu glu pro asn ala ala pro arg thr gin 
1531/511 

GCC AGC GAC GAC GCG CAG GTC ATC CTG ACC 
ala ser asp asp ala gin val ile leu thr 
1591/531 

GCC GTG CCG CGA CCA CTA CCG GCG GTG ATC 
ala val pro arg pro leu pro ala val ile 
1651/551 

GAA CCC CCG GGC GCT TAC AGC GCC GCT CGC 
glu pro pro gly ala tyr ser ala ala arg 
1711/571 

ATC GGC GGG CAG GTT GCC CGG CTA TGG AAG 
ile gly gly gin val ala arg leu trp lys 
1771/591 

ACC GGG CTG ACC GGC GTG CAG TAC ACC GCA 
thr gly leu thr gly val gin tyr thr ala 
1831/611 

AGC CAA TCG CTA CCA CCC GAT ACC CGC AAC 
ser gin ser leu pro pro asp thr arg asn 
1891/631 

GGA AAG ACG ATC GAC GAT CTT TTC GGC GCG 
gly lys thr ile asp asp leu phe gly ala 
1951/651 

ACT CTG GCC ACC GAG CAC AGT CCG CTG CCG 
thr leu ala thr glu his ser pro leu pro 
2011/671 

ATC CGG GTC CGG CTA CAG GTC GAT GCT CCG 
ile arg val arg leu gin val asp ala pro 
2071/691 

ATC GAG CTA CCG CCC GGG TAC CTG CCG CTA 
ile glu leu pro pro gly tyr leu pro leu 
2131/711 

CGG GTT GCC GTC GAC GTG TCG CTG CGG ACC 
arg val ala val asp val ser leu arg thr 
2191/731 

CGG TTG TCG GTG CAC TCC AAC GCC TAC GGC 
arg leu ser val his ser asn ala tyr gly 
2251/751 

GCG GCC GTG CTG GTA ACG CTG GCG GGC CGG 
ala ala val leu val thr leu ala gly arg 
2311/771 

GAT CGC GCC GAC CTG GAT CGC CCC GAC CTG 
asp arg ala asp leu asp arg pro asp leu 
2371/791 

GCC GTA GCC AGT CGG GAT GAC GAA AAG CAC 
ala val ala ser arg asp asp glu lys his 



SEQ ID NOS: 618-619 (continued 2) 

FIG. 36D (continu d 2) 




ORF according to Cole et al . , 1998 (Nature 393 537-544) and containing Rv 3909. 
1/1 _^-SEQ ID NO: 620 31/11 

TGA CTC AGC ACC GGG TCA GCA CAA CGG TCC CGG GCC GGG GCC GTG ACC GCA CTG CAA CTC 
OPA(leu ser thr gly ser ala gin arg ser arg ala gly ala val thr ala leu gin leu 
61/2r*^-SEQ ID NO: 621 91/31 

GGC TGG GCC GCT TTG GCG CGC GTC ACC TCA GCG ATC GGC GTC GTG GCC GGC CTC GGG ATG 
gly trp ala ala leu ala arg val thr ser ala ile gly val val ala gly leu gly met 
121/41 151/51 

GCG CTC ACG GTA CCG TCG GCG GCA CCG CAC GCG CTC GCA GGC GAG CCC AGC CCG ACG CCT 
ala leu thr val pro ser ala ala pro his ala leu ala gly glu pro ser pro thr pro 
181/61 211/71 

TTT GTC CAG GTC CGC ATC GAT CAG GTG ACC CCG GAC GTG GTG ACC ACT TCC AGC GAA CCC 
phe val gin val arg ile asp gin val thr pro asp val val thr thr ser ser glu pro 
241/81 271/91 

CAT GTC ACC GTC AGC GGA ACG GTG ACC AAT ACC GGT GAC CGC CCA GTC CGC GAT GTG ATG 
his val thr val ser gly thr val thr asn thr gly asp arg pro val arg asp val met 
301/101 331/111 

GTC CGG CTT GAG CAC GCC GCC GCG GTC ACG TCG TCA ACG GCG TTA CGC ACC TCG CTC GAC 
val arg leu glu his ala ala ala val thr ser ser thr ala leu arg thr ser leu asp 
361/121 391/131 

GGC GGC ACC GAC CAG TAC CAG CCG GCC GCG GAC TTC CTC ACG GTC GCC CCC GAA CTA GAC 
gly gly thr asp gin tyr gin pro ala ala asp phe leu thr val ala pro glu leu asp 
421/141 451/151 

CGC GGG CAA GAG GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC TCG CTG ACC AGG CCG TCG 
arg gly gin glu ala gly phe thr leu ser ala pro leu arg ser leu thr arg pro ser 
481/161 511/171 

TTG GCC GTC AAC CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC GTC AAT GGG ACA CCC GAC 
leu ala val asn gin pro gly ile tyr pro val leu val asn val asn gly thr pro asp 
541/181 571/191 

TAC GGT GCG CCT GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG CCC GTG GTC GGA GTG CCA 
tyr gly ala pro ala arg leu asp asn ala arg phe leu leu pro val val gly val pro 
601/201 631/211 

CCC GAC CAG GCC ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA ACG ACG GCG CCG GTC TGG 
pro asp gin ala thr asp phe gly ser ala val ala pro glu thr thr ala pro val trp 
661/221 691/231 

ATC ACC ATG CTG TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC CCC GGG GCA CCC GGT GGC 
ile thr met leu trp pro leu ala asp arg pro arg leu ala pro gly ala pro gly gly 
721/241 751/251 

ACC GTT CCC GTC CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG CTG GCC AAC GGC GGC CGG 
thr val pro val arg leu val asp asp asp leu ala asn ser leu ala asn gly gly arg 
781/261 811/271 

CTG GAC ATC CTC CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG GAA GTC GAC CCC GAC GGC 
leu asp ile leu leu ser ala ala glu phe ala thr asn arg glu val asp pro asp gly 
841/281 871/291 

GCC GTC GGC CGA GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA CTC ATC ACC GTC AAT GCG 
ala val gly arg ala leu cys leu ala ile asp pro asp leu leu ile thr val asn ala 
901/301 931/311 

ATG ACC GGC GGC TAC GTC GTG TCC GAC TCG CCC GAC GGG GCC GCT CAA CTA CCG GGC ACC 
met thr gly gly tyr val val ser asp ser pro asp gly ala ala gin leu pro gly thr 
961/321 991/331 

CCG ACC CAC CCG GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG CTG GAT CGA TTG CGG ACG 
pro thr his pro gly thr gly gin ala ala ala ser ser trp leu asp arg leu arg thr 



SEQ ID NOS: 620-621 



FIG. 36E 



1021/341 

CTA GTC CAC CGG ACA TGC GTG 
leu val his arg thr cys val 
1081/361 

CAG CGG GTT AAT GAT CCG AGG 
gin arg val asn asp pro arg 
1141/381 

GAC CGC ATC CTG GAT GTC AGC 
asp arg ile leu asp val ser 
1201/401 

ACC GGC CGG GCG ATC AAC TTG 
thr gly arg ala ile asn leu 
1261/421 

GAT TTT AGC CCC GAG GAA CAG 
asp phe ser pro glu glu gin 
1321/441 

ACC GCG CCC CGG CGG TTG TCC 
thr ala pro arg arg leu ser 
1381/461 

GCC GCG CTG GCC GCC GCG GGA 
ala ala leu ala ala ala gly 
1441/481 

TTC GTT CGG ATC GCG CAT GAA 
phe val arg ile ala his glu 
1501/501 

CTG TGG CGC AGC TTG GAG CCG 
leu trp arg ser leu glu pro 
1561/521 

TCG TGG AGC CTG GCC AGC GAC 
ser trp ser leu ala ser asp 
1621/541 

CGG TCT GGC CTG GCC GTG CCG 
arg ser gly leu ala val pro 
1681/561 

ACC GAG CCA CCG GAA CCC CCG 
thr glu pro pro glu pro pro 
1741/581 

ATC ACC ACG CAG ATC GGC GGG 
ile thr thr gin ile gly gly 
1801/601 

ATC GAT GAC CGC ACC GGG CTG 
ile asp asp arg thr gly leu 
1861/621 

TTG CGC GCG CTG AGC CAA TCG 
leu arg ala leu ser gin ser 
1921/641 

CTG GCC GTC GTT GGA AAG ACG 
leu ala val val gly lys thr 
1981/661 

GGC GGC TCC TAG ACT CTG GCC 
gly gly ser tyr thr leu ala 



1051/351 
ACG CCG CTG CCT TTT GCC 
thr pro leu pro phe ala 

1111/371 
CTG AGC GCG ATC GCA ACC 
leu ser ala ile ala thr 

1171/391 
TCC ACC CGC GGC GCA ACC 
ser thr arg gly ala thr 

1231/411 
CTC AGC ACC CAC GGC AAC 
leu ser thr his gly asn 

1291/431 
CAG GGT TCG TCC CAG ATC 
gin gly ser ser gin ile 

1351/451 
CCG CGG GTG GTA GCG GCG 
pro arg val val ala ala 

1411/471 
ACA AAC CCG ACC GTT CCT 
thr asn pro thr val pro 

1471/491 
TCG ATC ACC GCG CGC CGC 
ser ile thr ala arg arg 

1531/511 
AAT GCC GCG CCC CGT ACC 
asn ala ala pro arg thr 

1591/531 
GAC GCG CAG GTC ATC CTG 
asp ala gin val ile leu 

1651/551 
CGA CCA CTA CCG GCG GTG 
arg pro leu pro ala val 

1711/571 
GGC GCT TAC AGC GCC GCT 
gly ala tyr ser ala ala 

1771/591 
CAG GTT GCC CGG CTA TGG 
gin val ala arg leu trp 

1831/611 
ACC GGC GTG CAG TAC ACC 
thr gly val gin tyr thr 

1891/631 
CTA CCA CCC GAT ACC CGC 
leu pro pro asp thr arg 

1951/651 
ATC GAC GAT CTT TTC GGC 
ile asp asp leu phe gly 

2011/671 
ACC GAG CAC AGT CCG CTG 
thr glu his ser pro leu 



CAA GCC GAC CTG GAT GCT TTG 
gin ala asp leu asp ala leu 

ATC AGC CCC GCC GAC ATC GTC 
ile ser pro ala asp ile val 

GTG CTG CCC GAC GGC CCG TTG 
val leu pro asp gly pro leu 

ACG GTT GCC GTC GCG GCC GCC 
thr val ala val ala ala ala 

GGC TCC GCG CTC TTA CCC GCT 
gly ser ala leu leu pro ala 

CCG TTT GAT CCC GCG GTC GGG 
pro phe asp pro ala val gly 

ACC TAT CTA GAT CCC TCG TTG 
thr tyr leu asp pro ser leu 

CAG GAC GCC TTG GGC GCA ATG 
gin asp ala leu gly ala met 

CAA ATC CTG GTG CCG CCG GCG 
gin ile leu val pro pro ala 

ACC GCG CTG GCC ACC GCC ATC 
thr ala leu ala thr ala ile 

ATC GCT GAC GCC GCG GCC CGC 
ile ala asp ala ala ala arg 

CGC GGC CGG TTC AAT GAC GAC 
arg gly arg phe asn asp asp 

AAG CTG ACC TCG GCG TTG ACC 
lys leu thr ser ala leu thr 

GCA CCA CTA CGC GAG GAC ATG 
ala pro leu arg glu asp met 

AAC GGG CTG GCC CAG CAG CGG 
asn gly leu ala gin gin arg 

GCG GTG ACC ATC GTC AAC CCG 
ala val thr ile val asn pro 

CCG TTG GCG CTG CAT AAT GGC 
pro leu ala leu his asn gly 
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2 6 2003 S? 



2041/681 2071/691 



CTC GCC GTG 


CCA 


ATC 


CGG 


GTC 


CGG 


CTA 


CAG 


GTC GAT GCT 


CCG 


CCC 


GGG 


ATG 


ACG 


GTG 


GCC 


leu ala val 


pro 


ile 


arg 


val 


arg 


leu 


gin 


val asp ala 


pro 


pro 


gly 


met 


thr 


val 


ala 


2101/701 
















2131/711 
















GAT GTC GGT 


CAG 


ATC 


GAG 


CTA 


CCG 


CCC 


GGG 


TAC CTG CCG 


CTA 


CGA 


GTA 


CCA 


ATC 


GAG 


GTG 


asp val gly 


gin 


ile 


glu 


leu 


pro 


pro 


gly 


tyr leu pro 


leu 


arg 


val 


pro 


ile 


glu 


val 


2161/721 
















2191/731 
















AAC TTC ACA 


CAG 


CGG 


GTT 


GCC 


GTC 


GAC 


GTG 


TCG CTG CGG 


ACC 


CCC 


GAC 


GGC 


GTC 


GCG 


CTG 


asn phe thr 


gin 


arg 


val 


ala 


val 


asp 


val 


ser leu arg 


thr 


pro 


asp 


gly 


val 


ala 


leu 


2221/741 
















2251/751 
















GGT.GAA CCG 


GTG 


CGG 


TTG 


TCG 


GTG 


CAC 


TCC 


AAC GCC TAC 


GGC 


AAG 


GTG 


TTG 


TTC 


GCG 


ATC 


gly glu pro 


val 


arg 


leu 


ser 


val 


his 


ser 


asn ala tyr 


gly 


lys 


val 


leu 


phe 


ala 


ile 


2281/761 
















2311/771 
















ACG CTA TCC 


GCT 


GCG 


GCC 


GTG 


CTG 


GTA 


ACG 


CTG GCG GGC 


CGG 


CGC 


CTT 


TGG 


CAC 


CGG 


TTC 


thr leu ser 


ala 


ala 


ala 


val 


leu 


val 


thr 


leu ala gly 


arg 


arg 


leu 


trp 


his 


arg 


phe 


2341/781 
















2371/791 
















CGT GGC CAG 


CCT 


GAT 


CGC 


GCC 


GAC 


CTG 


GAT 


CGC CCC GAC 


CTG 


CCT 


ACC 


GGC 


AAA 


CAC 


GCC 


arg gly gin 


pro 


asp 


arg 


ala 


asp 


leu 


asp 


arg pro asp 


leu 


pro 


thr 


gly 


lys 


his 


ala 


2401/801 
















2431/811 
















CCG CAG CGC 


CGT 


GCC 


GTA 


GCC 


AGT 


CGG 


GAT 


GAC GAA AAG 


CAC 


CGG 


GTA 


TGA 








pro gin arg 


arg 


ala 


val 


ala 


ser 


arg 


asp 


asp glu lys 


his 


arg 


val)OPA 









SEQ XD NOS: 620-621 (continued 2) 

FIG. 36E ( cont inued 2 ) 



1/1 ^ -SEQ ID NOs 622 

ATC CGC GCG TTG GCG TCG CAT CCG AAC ATC 
( ile a rg ala leu ala ser his pro asn ile 
61/21^- SEQ ID NO: 623 

CAC AGC GGC GCC CAA ATC ATG GCC GAC ACC 
his ser gly ala gin ile met ala asp thr 
121/41 

CTC AAC CTG CCC TGG CTG GCC ATG GGC GCC 
leu asn leu pro trp leu ala met gly ala 
181/61 

GCA GCC GGG CAG CTT CGA GAG TTG TTG TCC 

ala ala gly gin leu arg glu leu leu ser 

241/81 

CGC AAG ATC 

arg lys ile) 



31/11 

GTC GGA GTC AAG GAC GCC AAA GCC GAC CTG 
val gly val lys asp ala lys ala asp leu 
91/31 

GGA CTG GCC TAC TAT TCC GGC GAC GAC GCG 
gly leu ala tyr tyr ser gly asp asp ala 
151/51 

ACG GGC TTC ATC AGC GTG ATT GCC CAC CTG 
thr gly phe ile ser val ile ala his leu 
211/71 

GCC TTC GGT TCT GGG GAT ATC GCC ACC GCC 
ala phe gly ser gly asp ile ala thr ala 



SEQ ID NOS: 622-623 



FIG. 37A 



1/1 SEQ ID NO: 624 31/11 





CGC 


GTT 


GGC 


GTC 


GCA 


ILL 


GAA 


CAT 


CGT CGG 


AGT 


CAA 


GGA 


CGC 


CAA 


AGC 


CGA 


CCT 


(asp pro 


arg 


val 


gly 


val 


ala 


ser 


glu 


his 


arg arg 


ser 


gin 


gly 


arg 


gin 


ser 


arg 


pro 


61/21 


- — SEQ ID NO: 


: 625 








91/31 


















GCA C AG 


CGG 


CGC 


CCA 


AAT 


CAT 


GGC 


CGA ' 


CAC 


CGG ACT 


GGC 


CTA 


CTA 


TTC 


CGG 


CGA 


CGA 


CGC 


ala gin 


arg 


arg 


pro 


asn 


his 


gly 


arg 


his 


arg thr 


gly 


leu 


leu 


phe 


arg 


arg 


arg 


arg 


121/41 


















151/51 


















GCT CAA 


CCT 


GCC 


CTG 


GCT 


GGC 


CAT 


GGG 


CGC 


CAC GGG 


CTT 


CAT 


CAG 


CGT 


GAT 


TGC 


CCA 


CCT 


ala gin 


pro 


ala 


leu 


ala 


gly 


his 


gly 


arg 


his gly 


leu 


his 


gin 


arg 


asp 


cys 


pro 


pro 


181/61 


















211/71 


















GGC AGC 


CGG 


GCA 


GCT 


TCG 


AGA 


GTT 


GTT 


GTC 


CGC CTT 


CGG 


TTC 


TGG 


GGA 


TAT 


CGC 


CAC 


CGC 


gly ser 


arg 


ala 


ala 


ser 


arg 


val 


val 


val 


arg leu 


arg 


phe 


trp 


gly 


tyr 


arg 


his 


arg 


241/81 




































CCG CAA 


GAT 


C 
































pro gin 


asp) 














































SEQ ID NOS:624- 


625 






























FIG. 37B 


















1/1^ — SEQ ID NO: 626 










31/11 


















TCC GCG 


CGT 


TGG 


CGT 


CGC 


ATC 


CGA 


ACA 


TCG 


TCG GAG 




AGG 


ACG 


CCA 


AAG 


CCG 


ACC 


TGC 


(ser ala 


arg 


trp 


arg 


arg 


ile 


arg 


thr 


ser 


Ser glu 


ser 


arg 


thr 


pro 


lys 


pro 


thr 


cys 


61/21 ' 


- SEQ ID NO: 627 












91/31 












ACA GCG 


GCG 


CCC 


AAA 


TCA 


TGG 


CCG 


ACA 


CCG 


GAC TGG 


CCT 


ACT 


ATT 


CCG 


GCG 


ACG 


ACG 


CGC 


thr ala 


ala 


pro 


lys 


ser 


trp 


pro 


thr 


pro 


asp trp 


pro 


thr 


ile 


pro 


ala 


thr 


thr 


arg 


121/41 


















151/51 


















TCA ACC 


TGC 


CCT 


GGC 


TGG 


CCA 


TGG 


GCG 


CCA 


CGG GCT 


TCA 


TCA 


GCG 


TGA 


TTG 


CCC 


ACC 


TGG 


ser thr 


cys 


pro 


gly 


trp 


pro 


trp 


ala 


pro 


arg ala 


ser 


ser 


ala)OPA(leu 


pro 


thr 


trp 


181/61 


















211/71 


SEQ 


ID NO: 628^ 










CAG CCG 


GGC 


AGC 


TTC 


GAG 


AGT 


TGT 


TGT 


CCG 


CCT TCG 


GTT 


CTG 


GGG 


ATA 


TCG 


CCA 


CCG 


CCC 


gin pro 


gly 


ser 


phe 


glu 


ser 


cys 




pro 


pro ser 


val 


leu 


gly 


ile 


ser 


pro 


pro 


pro 



241/81 
GCA AGA TC 
ala arg) 



SEQ ID NOS: 626-628 



FIG. 37C 




Coding sequence Rv2753c predicted by Cole et al . , 1998 (Nature 393 
537-544) containing Seq 37A 



1/1. SEQ ID NO: 629 










31/11 


















GTG ACC 


ACC 


GTC 


GGA 


TTC 


GAC 


GTC 


GCA 


GCG 


CGC CTA 


GGA 


ACC 


CTG 


CTG 


ACC 


GCG 


ATG 


GTG 


[val thr 


thr 


val 


gly 


phe 


asp 


val 


ala 


ala 


arg leu 


gly 


thr 


leu 


leu 


thr 


ala 


met 


val 


61/21^- 


SEQ 


ZD NO: 630 










91/31 


















ACA CCG 


TTT 


AGC 


GGC 


GAT 


GGC 


TCC 


CTG 


GAC 


ACC GCC 


ACC 


GCG 


GCG 


CGG 


CTG 


GCC 


AAC 


CAC 


thr pro 


phe 


ser 


gly 


asp 


gly 


ser 


leu 


asp 


thr ala 


thr 


ala 


ala 


arg 


leu 


ala 


asn 


his 


121/41 


















151/51 


















CTG GTC 


GAT 


CAG 


GGG 


TGC 


GAC 


GGT 


CTG 


GTG 


GTC TCG 


GGC 


ACC 


ACC 


GGC 


GAG 


TCG 


CCG 


ACC 


leu val 


asp 


gin 


gly 


cys 


asp 


gly 


leu 


val 


val ser 


gly 


thr 


thr 


gly 


glu 


ser 


pro 


thr 


181/61 


















211/71 


















ACC ACC 


GAC 


GGG 


GAG 


AAA 


ATC 


GAG 


CTG 


CTG 


CGG GCC 


GTC 


TTG 


GAA 


GCG 


GTG 


GGG 


GAC 


CGG 


thr thr 


asp 


gly 


glu 


lys 


ile 


glu 


leu 


leu 


arg ala 


val 


leu 


glu 


ala 


val 


gly 


asp 


arg 


241/81 


















271/91 


















GCC CGT 


GTT 


ATC 


GCC 


GGT 


GCC 


GGC 


ACC 


TAT 


GAC ACC 


GCG 


CAC 


AGC 


ATC 


CGG 


CTG 


GCC 


AAG 


ala arg 


val 


ile 


ala 


gly 


ala 


gly 


thr 


tyr 


asp thr 


ala 


his 


ser 


ile 


arg 


leu 


ala 


lys 


301/101 


















331/111 


















GCT TGT 


GCG 


GCC 


GAG 


GGT 


GCG 


CAC 


GGG 


CTG 


CTG GTG 


GTC 


ACG 


CCC 


TAC 


TAT 


TCC 


AAG 


CCG 


ala cys 


ala 


ala 


glu 


gly 


ala 


his 


gly 


leu 


leu val 


val 


thr 


pro 


tyr 


tyr 


ser 


lys 


pro 


361/121 


















391/131 


















CCG CAG 


CGG 


GGG 


CTG 


CAA 


GCC 


CAT 


TTC 


ACC 


GCC GTC 


GCC 


GAC 


GCG 


ACC 


GAG 


CTG 


CCG 


ATG 


pro gin 


arg 


gly 


leu 


gin 


ala 


his 


phe 


thr 


ala val 


ala 


asp 


ala 


thr 


glu 


leu 


pro 


met 


421/141 


















451/151 


















CTG CTC 


TAT 


GAC 


ATC 


CCG 


GGG 


CGG 


TCG 


GCG 


GTG CCG 


ATC 


GAG 


CCC 


GAC 


ACG 


ATC 


CGC 


GCG 


leu leu 


tyr 


asp 


ile 


pro 


gly 


arg 


ser 


ala 


val pro 


ile 


glu 


pro 


asp 


thr 


ile 


arg 


ala 


481/161 


















511/171 


















TTG GCG 


TCG 


CAT 


CCG 


AAC 


ATC 


GTC 


GGA 


GTC 


AAG GAC 


GCC 


AAA 


GCC 


GAC 


CTG 


CAC 


AGC 


GGC 


leu ala 


ser 


his 


pro 


asn 


ile 


val 


gly 


val 


lys asp 


ala 


lys 


ala 


asp 


leu 


his 


ser 


gly 


541/181 


















571/191 


















GCC CAA 


ATC 


ATG 


GCC 


GAC 


ACC 


GGA 


CTG 


GCC 


TAC TAT 


TCC 


GGC 


GAC 


GAC 


GCG 


CTC 


AAC 


CTG 


ala gin 


ile 


met 


ala 


asp 


thr 


gly 


leu 


ala 


tyr tyr 


ser 


gly 


asp 


asp 


ala 


leu 


asn 


leu 


601/201 


















631/211 


















CCC TGG 


CTG 


GCC 


ATG 


GGC 


GCC 


ACG 


GGC 


TTC 


ATC AGC 


GTG 


ATT 


GCC 


CAC 


CTG 


GCA 


GCC 


GGG 


pro trp 


leu 


ala 


met 


gly 


ala 


thr 


gly 


phe 


ile ser 


val 


ile 


ala 


his 


leu 


ala 


ala 


gly 


661/221 


















691/231 


















CAG CTT 


CGA 


GAG 


TTG 


TTG 


TCC 


GCC 


TTC 


GGT 


TCT GGG 


GAT 


ATC 


GCC 


ACC 


GCC 


CGC 


AAG 


ATC 


gin leu 


arg 


glu 


leu 


leu 


ser 


ala 


phe 


gly 


ser gly 


asp 


ile 


ala 


thr 


ala 


arg 


lys 


ile 


721/241 


















751/251 


















AAC ATT 


GCG 


GTC 


GCC 


CCG 


CTG 


TGC 


AAC 


GCG 


ATG AGC 


CGC 


CTG 


GGT 


GGG 


GTG 


ACG 


TTG 


TCC 


asn ile 


ala 


val 


ala 


pro 


leu 


cys 


asn 


ala 


met ser 


arg 


leu 


gly 


gly 


val 


thr 


leu 


ser 


781/261 


















811/271 


















AAG GCG 


GGC 


TTG 


CGG 


CTG 


CAG 


GGC 


ATC 


GAC 


GTC GGT 


GAT 


CCC 


CGG 


CTG 


CCC 


CAG 


GTG 


GCC 


lys ala 


gly 


leu 


arg 


leu 


gin 


gly 


ile 


asp 


val gly 


asp 


pro 


arg 


leu 


pro 


gin 


val 


ala 


841/281 


















871/291 


















GCG ACA 


CCG 


GAG 


CAG 


ATC 


GAC 


GCG 


TTG 


GCC 


GCC GAC 


ATG 


CGC 


GCG 


GCC 


TCG 


GTG 


CTT 


CGG 


ala thr 


pro 


glu 


gin 


ile 


asp 


ala 


leu 


ala 


ala asp 


met 


arg 


ala 


ala 


ser 


val 


leu 


arg) 



901/301 

TGA 

OPA 



SEQ ID NOS: 629-630 



FIG. 37D 



ORF according to Cole et al . , 1998 (Nature 393 537-544) containing Rv2753c 



1/1^ — SEQ ID NO: 631 




TAA GGT 


GAG 


CGC 


pprp 
V_ Vj J. 






OCH (gly 


glu 


arg 


ox y 


y xy 


arg 


61 /2T^^ 


— SEQ ID 


E*\J m 


O 




V— X J. V— X A 


CICIC 






pp 

CGA 


CG 1 


J- LA ICLl 


gly 


pro 


cnr 


arg 


arg 


Xzi X / *± X 


















CCG 


GGC 


CTG 


a r g vs 1 


arg 


arg 


pro 


gly 


leu 


i ft i / i 

lOl / OX 














CGG 


CTA 


CCT 


CCG 


GCA 


gin arg 


arg 


leu 


pro 


pro 


ala 


0>l 1 / Q1 

z 411 / o X 












LAG CGT 


GTC 


GTT 


CTA 


CAT 


CAC 


gin arg 


val 


val 


leu 


his 


his 


•a n 1 / 1 n i 

jUI/ 1U1 












t~l_G GL-A 


TTT 


CTC 


CTA 


CTC 


GCA 


pro ala 


phe 


leu 


leu 


leu 


ala 


7^1 / 1 9 1 
O D JL / XZ JL 














G(-C 


GC_(_ 


CGG 


CAT 


GGA 


arg arg 


axa 


axa 


arg 


nis 


gly 


4.9 1 / 1 41 

4 ^ X / JL 4 X 














ppp 






CAC 


CTA 


arg arg 


arg 


pro 


arg 


his 


leu 


4ft 1 /i fii 

r± O X / X vJ X 


















OAT 1 


CC 1 




pro thr 


gin 


arg 


asp 


pro 


aia 


541/181 












PAA pop 


CAC 


CGA 


AAV-. 


CCG 




gin arg 


his 


arg 


L asn 


pro 


nis 


601 /201 

\J \J X. / £i \J A. 












GGC AAT 


GCG 


GGC 






ppa 


glx y ClSll 


ala 


y xy 


gin 


arg 


axa 


661/221 












GCG CCA 


GCT 








riric 


CL X GL X. V»J 


ala 


arg 


CI x y 


wig 


9"iy 


721/241 












PGA rnn 


CAC 


CGA 




cine 






his 


arg 


y xy 


rrl ■%./ 

y j-y 


asp 


781/261 












GGA CAA 


ACA 


CGC 


GCG 


CTC 


GCG 


gly gin 


thr 


arg 


ala 


leu 


ala 


841/281 












ACC GTC 


GGA 


TTC 


GAC 


GTC 


GCA 


thr val 


gly 


phe 


asp 


val 


ala 


901/301 












TTT AGC 


GGC 


GAT 


GGC 


TCC 


CTG 


phe ser 


gly 


asp 


gly 


ser 


leu 


961/321 












GAT CAG 


GGG 


TGC 


GAC 


GGT 


CTG 


asp gin 


gly 


cys 


asp 


gly 


leu 



31/11 

GAC CGC GCC GCT GCG CGT 
asp arg ala ala ala arg 

GCC CTG GAC CAC CGA CGC 
ala leu asp his arg arg 
151/51 

CTA TCA GAG CTG GTC CAA 
leu ser glu leu val gin 
211/71 

CAT CAT CGA CGT CGG ACA 
his his arg arg arg thr 

271/91 
CGG GAT CTC GCG ATC GTG 
arg asp leu ala ile val 

331/111 
GCT CTC CCA GCG CTA CGT 
ala leu pro ala leu arg 

391/131 
GGA CGA CGC CGA CCT GCG 
gly arg arg arg pro ala 

451/151 
CAG CGA GCT GCT GGC CAA 
gin arg ala ala gly gin 

511/171 
CCG CAA GCA GGC CCG CCA 
pro gin ala gly pro pro 

571/191 
CGT GGT GAC CGG CAA CTA 
arg gly asp arg gin leu 

631/211 
CGC CGA CGT GGA AAT CCG 
arg arg arg gly asn pro 

691/231 
CCC CGC GGT GTT CGC CGA 
pro arg gly val arg arg 

751/251 
CAG CCC GTT GGC GAC CGA 
gin pro val gly asp arg 

811/271 
GCC GGG ATA AAG CGC CAG 
ala gly ile lys arg gin 

871/291 
GCG CGC CTA GGA ACC CTG 
ala arg leu gly thr leu 

931/311 
GAC ACC GCC ACC GCG GCG 
asp thr ala thr ala ala 

991/331 
GTG GTC TCG GGC ACC ACC 
val val ser gly thr thr 



GCA ACT GAT CGC CAA GAC CGA 
ala thr asp arg gin asp arg 
91/31 

CGA CGG CGG ACC CGC GCT GGT 
arg arg arg thr arg ala gly 

GCC CAA TCC CAA GAC CGC CAC 
ala gin ser gin asp arg his 

TTT CTC GGT GCT AGA GCA TGC 
phe leu gly ala arg ala cys 

CAC CCA CGA GCT GAT CCG CCA 
his pro arg ala asp pro pro 

ACC CGA GAA GGA CTC GCG GGT 
thr arg glu gly leu ala gly 

CCA CAT CCT GAC CGA GGC CGC 
pro his pro asp arg gly arg 

GCT GGA AGC CAA GTT CGC CGA 
ala gly ser gin val arg arg 

AGC CGC CCG CGC GGT GCT GCC 
ser arg pro arg gly ala ala 

CCG GGC CTG GCG GCA CTT CAT 
pro gly leu ala ala leu his 

GCG ACT GGC CAT CGA ATG CCT 
ala thr gly his arg met pro 

CTT CGA GGT GAC CAC CCT GGC 
leu arg gly asp his pro gly 

AGC CTG AGG CGG CGT GTC GCT 
ser leu arg arg arg val ala 

GTA ACC TTG GGA GCC GTG ACC 
val thr leu gly ala val thr 

CTG ACC GCG ATG GTG ACA CCG 
leu thr ala met val thr pro 

CGG CTG GCC AAC CAC CTG GTC 
arg leu ala asn his leu val 

GGC GAG TCG CCG ACC ACC ACC 
gly glu ser pro thr thr thr 



SEQ ID NOS: 631-632 



FIG. 37E 



J? 



1021/341 
















1051/351 
















GAC GGG GAG 


AAA 


ATC 


GAG 


CTG 


CTG 


CGG 


GCC 


GTC TTG GAA 


GCG 


GTG 


GGG 


GAC 


CGG 


GCC 


CGT 


asp gly glu 


lys 


ile 


glu 


leu 


leu 


arg 


ala 


val leu glu 


ala 


val 


gly 


asp 


arg 


ala 


arg 


1081/361 
















1111/371 
















GTT ATC GCC 


GGT 


GCC 


GGC 


ACC 


TAT 


GAC 


ACC 


GCG CAC AGC 


ATC 


CGG 


CTG 


GCC 


AAG 


GCT 


TGT 


val ile ala 


gly 


ala 


gly 


thr 


tyr 


asp 


thr 


ala his ser 


ile 


arg 


leu 


ala 


lys 


ala 


cys 


1141/381 
















1171/391 
















GCG GCC GAG 


GGT 


GCG 


CAC 


GGG 


CTG 


CTG 


GTG 


GTC ACG CCC 


TAC 


TAT 


TCC 


AAG 


CCG 


CCG 


CAG 


ala ala glu 


gly 


ala 


his 


gly 


leu 


leu 


val 


val thr pro 


tyr 


tyr 


ser 


lys 


pro 


pro 


gin 


1201/401 
















1231/411 
















CGG GGG CTG 


CAA 


GCC 


CAT 


TTC 


ACC 


GCC 


GTC 


GCC GAC GCG 


ACC 


GAG 


CTG 


CCG 


ATG 


CTG 


CTC 


arg gly leu 


gin 


ala 


his 


phe 


thr 


ala 


val 


ala asp ala 


thr 


glu 


leu 


pro 


met 


leu 


leu 


1261/421 
















1291/431 
















TAT GAC ATC 


CCG 


GGG 


CGG 


TCG 


GCG 


GTG 


CCG 


ATC GAG CCC 


GAC 


ACG 


ATC 


CGC 


GCG 


TTG 


GCG 


tyr asp ile 


pro 


gly 


arg 


ser 


ala 


val 


pro 


ile glu pro 


asp 


thr 


ile 


arg 


ala 


leu 


ala 


1321/441 
















1351/451 
















TCG CAT CCG 


AAC 


ATC 


GTC 


GGA 


GTC 


AAG 


GAC 


GCC AAA GCC 


GAC 


CTG 


CAC 


AGC 


GGC 


GCC 


CAA 


ser his pro 


asn 


ile 


val 


gly 


val 


lys 


asp 


ala lys ala 


asp 


leu 


his 


ser 


gly 


ala 


gin 


1381/461 
















1411/471 
















ATC ATG GCC 


GAC 


ACC 


GGA 


CTG 


GCC 


TAC 


TAT 


TCC GGC GAC 


GAC 


GCG 


CTC 


AAC 


CTG 


CCC 


TGG 


ile met ala 


asp 


thr 


gly 


leu 


ala 


tyr 


tyr 


ser gly asp 


asp 


ala 


leu 


asn 


leu 


pro 


trp 


1441/481 
















1471/491 
















CTG GCC ATG 


GGC 


GCC 


ACG 


GGC 


TTC 


ATC 


AGC 


GTG ATT GCC 


CAC 


CTG 


GCA 


GCC 


GGG 


CAG 


CTT 


leu ala met 


gly 


ala 


thr 


gly 


phe 


ile 


ser 


val ile ala 


his 


leu 


ala 


ala 


gly 


gin 


leu 


1501/501 
















1531/511 
















CGA GAG TTG 


TTG 


TCC 


GCC 


TTC 


GGT 


TCT 


GGG 


GAT ATC GCC 


ACC 


GCC 


CGC 


AAG 


ATC 


AAC 


ATT 


arg glu leu 


leu 


ser 


ala 


phe 


gly 


ser 


gly 


asp ile ala 


thr 


ala 


arg 


lys 


ile 


asn 


ile 


1561/521 
















1591/531 
















GCG GTC GCC 


CCG 


CTG 


TGC 


AAC 


GCG 


ATG 


AGC 


CGC CTG GGT 


GGG 


GTG 


ACG 


TTG 


TCC 


AAG 


GCG 


ala val ala 


pro 


leu 


cys 


asn 


ala 


met 


ser 


arg leu gly 


gly 


val 


thr 


leu 


ser 


lys 


ala 


1621/541 
















1651/551 
















GGC TTG CGG 


CTG 


CAG 


GGC 


ATC 


GAC 


GTC 


GGT 


GAT CCC CGG 


CTG 


CCC 


CAG 


GTG 


GCC 


GCG 


ACA 


gly leu arg 


leu 


gin 


gly 


ile 


asp 


val 


gly 


asp pro arg 


leu 


pro 


gin 


val 


ala 


ala 


thr 


1681/561 
















1711/571 
















CCG GAG CAG 


ATC 


GAC 


GCG 


TTG 


GCC 


GCC 


GAC 


ATG CGC GCG 


GCC 


TCG 


GTG 


CTT 


CGG 


TGA 




pro glu gin 


ile 


asp 


ala 


leu 


ala 


ala 


asp 


met arg ala 


ala 


ser 


val 


leu 


arg)OPA 





SEQ XD NOS: 631-632 (continued 1) 



FIG. 37E (continued 1) 




1/1_^-SEQ ID NOs 633 

GCG GTG AAC TGG TGG GCC CGG 
( ala va l asn trp trp ala arg 

61/21^^ SEQ ID NOs 634 

AGA CGA CGG ATG GAA GGA GAT 
arg arg arg met glu gly asp 
121/41 

TCG TCG TCT ACG GAG GTG AAG 
ser ser ser thr glu val lys 
181/61 

ACT GGC CCG CAG GTG AAG GCG 
thr gly pro gin val lys ala 
241/81 

GAC GCG TGC CCA GAA CAG GCC 
asp ala cys pro glu gin ala 
301/101 

CTT GTT GGC ATT GCG GCG ACG 
leu val gly ile ala ala thr 
361/121 

TTT GCG TTG CGC TCA CAC CAG 
phe ala leu arg ser his gin 
421/141 

GAG GCC GCT AAG GAT TGC GTT 
glu ala ala lys asp cys val 
481/161 

AGC ATG CAG AAG ATC 
ser met gin lys ile) 



31/11 

ATG GTT CAA GTA CGC CGT 
met val gin val arg arg 
91/31 

GCT GGC GCC GGC CAG CTG 
ala gly ala gly gin leu 
151/51 

GCG GCG GAT TCG GCG GAA 
ala ala asp ser ala glu 

211/71 
GCG GAT TCG GCG GAA TCT 
ala asp ser ala glu ser 

271/91 
CTC GTC GAG CGG CGC CCG 
leu val glu arg arg pro 

331/111 
CTG CTC GCG TTG GCC GGT 
leu leu ala leu ala gly 

391/131 
GAA AGC CAA TCA ATC GCG 
glu ser gin ser ile ala 

451/151 
GCG GCC ACG CAG GCA CCC 
ala ala thr gin ala pro 



CGC AAA CTC GAG CAC AAC AGG 
arg lys leu glu his asn arg 

AAC CCT GCC GAT GCG AAT AAG 
asn pro ala asp ala asn lys 

TCT GAC GCC GGA GCC GAC CAG 
ser asp ala gly ala asp gin 

GAC GCC GGA GAG CTC GGC GAG 
asp ala gly glu leu gly glu 

TCG CGG TTG CGG CGA GGC TGG 
ser arg leu arg arg gly trp 

GGC CTT GGC GCA GCG GGT TAT 
gly leu gly ala ala gly tyr 

CGC GAG GAC CTT GCG GCC ATT 
arg glu asp leu ala ala ile 

GAT GCT GGG GCG ATG TCG GCT 
asp ala gly ala met ser ala 



SEQ XD NOS: 633-634 



FIG. 38A 



1/1 ^ — 


SEQ 


ID NO: 635 










CAG CGG 


TGA 


ACT 


GGT 


GGG 


CCC 


GGA 


TGG 


TTC 


gin arg 


OPA ( thr 


gly 


gly pro 


gly 


trp 


phe 


61/21 




SEQ ID NO: 


636 






GGA GAC 


GAC 


GGA 


TGG 


AAG 


GAG 


ATG 


CTG 


GCG 


gly asp 


asp 


gly 


trp 


lys 


glu 


met 


leu 


ala 


121/41 


















AGT CGT 


CGT 


CTA 


CGG 


AGG 


TGA 


AGG 


CGG 


CGG 


ser arg 


arg 


leu 


arg 


arg) OPA (arg 


arg 


arg 


181/61 


SEQ ID NO: 


638—^ 








AGA CTG 


GCC 


CGC 


AGG 


TGA 


AGG 


CGG 


CGG 


ATT 


arg leu 


ala 


arg 


arg) OPA (arg 


arg 


arg 


ile 


241/81 


SEQ ID 


NO: 


639/ 










AGG ACG 


CGT 


GCC 


CAG 


AAC 


AGG 


CCC 


TCG 


TCG 


arg thr 


arg 


ala 


gin 


asn 


arg 


pro 


ser 


ser 


301/101 


















GGC TTG 


TTG 


GCA 


TTG 


CGG 


CGA 


CGC 


TGC 


TCG 


gly leu 


leu 


ala 


leu 


arg 


arg 


arg 


cys 


ser 


361/121 


















ATT TTG 


CGT 


TGC 


GCT 


CAC 


ACC 


AGG 


AAA 


GCC 


ile leu 


arg 


cys 


ala 


his 


thr 


arg 


lys 


ala 


421/141 


















TTG AGG 


CCG 


CTA 


AGG 


ATT 


GCG 


TTG 


CGG 


CCA 


leu arg 


pro 


leu 


arg 


ile 


ala 


leu 


arg 


pro 


481/161 


















CTA GCA 


TGC 


AGA 


AGA 


TC 










leu ala 


cys 


arg 


arg) 













31/11 

AAG TAC GCC GTC GCA AAC TCG AGC ACA ACA 

lys tyr ala val ala asn ser ser thr thr 
91/31 

CCG GCC AGC TGA ACC CTG CCG ATG CGA ATA 

pro ala ser) OPA (thr leu pro met arg ile 

151/51 C— SEQ ID NO: 637 

ATT CGG CGG AAT CTG ACG CCG GAG CCG ACC 

ile arg arg asn leu thr pro glu pro thr 
211/71 

CGG CGG AAT CTG ACG CCG GAG AGC TCG GCG 

arg arg asn leu thr pro glu ser ser ala 
271/91 

AGC GGC GCC CGT CGC GGT TGC GGC GAG GCT 

ser gly ala arg arg gly cys gly glu ala 
331/111 

CGT TGG CCG GTG GCC TTG GCG CAG CGG GTT 

arg trp pro val ala leu ala gin arg val 
391/131 

AAT CAA TCG CGC GCG AGG ACC TTG CGG CCA 

asn gin ser arg ala arg thr leu arg pro 
451/151 

CGC AGG CAC CCG ATG CTG GGG CGA TGT CGG 

arg arg his pro met leu gly arg cys arg 



SEQ XD NOS: 635-639 



FIG. 38B 



1/1^ — SEQ ID NO: 640 










31/11 


















AGC GGT 


GAA 


CTG 


GTG 


GGC 


CCG 


GAT 


GGT 


TCA 


AGT ACG 


CCG 


TCG 


CAA 


ACT 


CGA 


GCA 


CAA 


CAG 


(ser gly 


glu 


leu 


val 


gly 


pro 


asp 


crlv 


ser 


ser thr 


pro 


ser 


gin 


thr 


arg 


ala 


gin 


gin 


61721 " 


SEQ 


ID NO: 641 










91/31 


















GAG ACG 


ACG 


GAT 






AHA 










GCT 


GAA 


CCC 


TGC 


CGA 


TGC 


GAA 


TAA 


glu thr 


thr 


asp 


gly 


arg 


arg 


cys 




cxi. y 


dx. y -i- 


ala 


glu 


pro 


cys 


arg 


cys 


glu)OCH 


121/41 


















i m / ^1 

lDi/ 3 -L 


















GTC GTC 


GTC 


TAC 




GGT 










1 1L tj^jVw 


GGA 


ATC 


TGA 


CGC 


CGG 


AGC 


CGA 


CCA 


(val val 


val 


tyr 


gly 




glu 


gly 


gly 


gly 


pne yiy 


gly 


ile)OPA(arg 


arg 


ser 


arg 


pro 


181/61 SEQ ID 


NO: 


642 


















^SEQ ID NO: 


r 643 


GAC TGG 


CCC 


GCA 


GGT 


GAA 












ATC 


TGA 


CGC 


CGG 


AGA 


GCT 


CGG 


CGA 


asp trp 


pro 


ala 


gly 


glu 


gly 


gly 


gly 


phe 


gly gly 


ile)OPA(arg 


arg 


arg 


ala 


arg 


arg 


241/81 


















271/91 






V SEQ ID 


NO: 


644 






GGA CGC 


GTG 


CCC 


AGA 


ACA 


GGC 


CCT 


CGT 


CGA 


GCG GCG 


CCC 


GTC 


GCG 


GTT 


GCG 


GCG 


AGG 


CTG 


gly arg 


val 


pro 


arg 


thr 


gly 


pro 


arg 


arg 


ala ala 


pro 


val 


ala 


val 


ala 


ala 


arg 


leu 


301/101 


















331/111 


















GCT TGT 


TGG 


CAT 


TGC 


GGC 


GAC 


GCT 


GCT 


CGC 


GTT GGC 


CGG 


TGG 


CCT 


TGG 


CGC 


AGC 


GGG 


TTA 


ala cys 


trp 


his 


cys 


gly 


asp 


ala 


ala 


arg 


val gly 


arg 


trp 


pro 


trp 


arg 


ser 


gly 


leu 


361/121 


















391/131 


















TTT TGC 


GTT 


GCG 


CTC 


ACA 


CCA 


GGA 


AAG 


CCA 


ATC AAT 


CGC 


GCG 


CGA 


GGA 


CCT 


TGC 


GGC 


CAT 


phe cys 


val 


ala 


leu 


thr 


pro 


gly 


lys 


pro 


ile asn 


arg 


ala 


arg 


gly 


pro 


cys 


gly 


his) 


421/141 


















451/151 


















TGA GGC 


CGC 


TAA 


GGA 


TTG 


CGT 


TGC 


GGC 


CAC 


GCA GGC 


ACC 


CGA 


TGC 


TGG 


GGC 


GAT 


GTC 


GGC 


OPA gly 


arg 


OCH(gly 


leu 


arg 


cys 


gly 


his 


ala gly 


thr 


arg 


cys 


trp 


gly 


asp 


val 


gly) 



481/161 ^-SEQ ID NO: 645 

TAG CAT GCA GAA GAT C 
AMB(his ala glu asp) 
V SEQ ID NO: 646 



SEQ XD NOS: 640-646 



FIG. 38C 



Sequence Rv0175 predicted by Cole et al . , 1998 {Nature 393 537-544) 
and containing seq3 8A 



1/1^ SEQ ID NO: 


: 647 










ji/il 


















GTG AAG 


GCG 


GCG 






GCG 


GAA 


TCT 


GAC 


GCC GGA 


GCC 


GAC 


CAG 


ACT 


GGC 


CCG 


CAG 


GTG 


(vallys 


ala 


ala 


asp 


ser 


aia 


gjLu 


ser 


asp 


ala gly 


ala 


asp 


gm 


thr 


gly 


pro 


gin 


val 


61/21 


■—SEQ ID 


NO: 












91/31 


















AAG GCG 


GCG 


GAT 


TCG 


GCG 


GAA 


TCT 


GAC 


GCC 


GGA GAG 


CTC 


GGC 


GAG 


GAC 


GCG 


TGC 


CCA 


GAA 


lys ala 


ala 


asp 


ser 


ala 


glu 


ser 


asp 


ala 


gly glu 


leu 


gly 


glu 


asp 


ala 


cys 


pro 


glu 


121/41 


















151/51 


















CAG GCC 


CTC 


GTC 


GAG 


CGG 


CGC 


CCG 


TCG 


CGG 


TTG CGG 


CGA 


GGC 


TGG 


CTT 


GTT 


GGC 


ATT 


GCG 


gin ala 


leu 


val 


glu 


arg 


arg 


pro 


ser 


arg 


leu arg 


arg 


gly 


trp 


leu 


val 


giy 


ile 


ala 


181/61 


















211/71 


















GCG ACG 


CTG 


CTC 


GCG 


TTG 


GCC 


GGT 


GGC 


CTT 


GGC GCA 


GCG 


GGT" 


TAT 


TTT 


GCG 


TTG 


CGC 


TCA 


ala thr 


leu 


leu 


ala 


leu 


ala 


gly 


gly 


leu 


gly ala 


ala 


gly 


tyr 


phe 


ala 


leu 


arg 


ser 


241/81 


















271/91 


















CAC CAG 


GAA 


AGC 


CAA 


TCA 


ATC 


GCG 


CGC 


GAG 


GAC CTT 


GCG 


GCC 


ATT 


GAG 


GCC 


GCT 


AAG 


GAT 


his gin 


glu 


ser 


gin 


ser 


ile 


ala 


arg 


glu 


asp leu 


ala 


ala 


ile 


glu 


ala 


ala 


lys 


asp 


301/101 


















331/111 


















TGC GTT 


GCG 


GCC 


ACG 


CAG 


GCA 


CCC 


GAT 


GCT 


GGG GCG 


ATG 


TCG 


GCT 


AGC 


ATG 


CAG 


AAG 


ATC 


cys val 


ala 


ala 


thr 


gin 


ala 


pro 


asp 


ala 


gly ala 


met 


ser 


ala 


ser 


met 


gin 


lys 


ile 


361/121 


















391/131 


















ATC GAG 


TGT 


GGC 


ALL 


f~y /-* m 


GAT 


TTC 


GGT 


GCC 


CAG GCG 


TCG 


TTG 


TAC 


ACC 


AGC 


ATG 


CTC 


GTC 


ile glu 


cys 


gly 


thr 


gly 


asp 


phe 


giy 


ala 


gin ala 


ser 


leu 


tyr 


tnr 


ser 


met 


leu 


val 


421/141 


















451/151 


















GAG GCG 


TAT 


CAA 


GCG 


GCC 


AGC 


GTC 


CAC 


GTG 


CAA GTG 


ACC 


GAT 


ATG 


CGC 


GCG 


GCG 


GTC 


GAG 


glu ala 


tyr 


gin 


ala 


ala 


ser 


val 


his 


val 


gin val 


thr 


asp 


met 


arg 


ala 


ala 


val 


glu 


481/161 


















511/171 


















CGC AAC 


AAC 


AAT 


GAC 


GGG 


TCG 


GTC 


GAT 


GTT 


CTG GTG 


GCG 


CTC 


CGG 


GTC 


AAG 


GTG 


TCC 


AAC 


arg asn 


asn 


asn 


asp 


gly 


ser 


val 


asp 


val 


leu val 


ala 


leu 


arg 


val 


lys 


val 


ser 


asn 


541/181 


















571/191 


















ACC GAC 


TCG 


GAT 


GCC 


CAT 


GAA 


GTC 


GGC 


TAC 


CGT CTT 


CGG 


GTC 


CGG 


ATG 


GCA 


CTG 


GAT 


GAG 


thr asp 


ser 


asp 


ala 


his 


glu 


val 


giy 


tyr 


arg leu 


arg 


val 


arg 


met 


ala 


leu 


asp 


glu 


601/201 


















631/211 


















GGC CGC 


TAT 


AAG 


ATC 


GCC 


AAA 


CTC 


GAC 


CAG 


GTG ACG 


AAG 


TGA 














gly arg 


tyr 


lys 


ile 


ala 


lys 


leu 


asp 


gin 


val thr 


lys)OPA 















SEQ ID NOS: 647-648 



FIG. 38D 




ORF according to Cole et al . , 1998 (Nature 393 537-544) containing Rv0175 



1/1-*- SEQ ID NO: 649 














31/11 












TGA ACT 


GGT 


GGG 


GCC 


GGA 


TGG 


TGT 


CAA 


GTA 


CGC CGT 


CGC 


AAA 


CTC 


GAG 


CAC 


AAC 


AGG 


AGA 


OPA ( thr 


gly 


gly 


ala 


gly 


trp 


cys 


gin 


val 


arg arg 


arg 


lys 


leu 


glu 


his 


asn 


arg 


arg 


61/21*^ 


— SEQ ID 


NO: 


650 














91/31 












CGA CGG 


ATG 


GAA 


GGA 


GAT 


GCT 


GGC 


GCC 


GGC 


CAG CTG 


AAC 


CCT 


GCC 


GAT 


GCG 


AAT 


AAG 


TCG 


arg arg 


met 


glu 


gly 


asp 


ala 


gly 


ala 


gly 


gin leu 


asn 


pro 


ala 


asp 


ala 


asn 


lys 


ser 


121/41 


















151/51 


















TCG TCT 


ACG 


GAG 


GTG 


AAG 


GCG 


GCG 


GAT 


TCG 


GCG GAA 


TCT 


GAC 


GCC 


GGA 


GCC 


GAC 


CAG 


ACT 


ser ser 


thr 


glu 


val 


lys 


ala 


ala 


asp 


ser 


ala glu 


ser 


asp 


ala 


gly 


ala 


asp 


gin 


thr 


181/61 


















211/71 


















GGC CCG 


CAG 


GTG 


AAG 


GCG 


GCG 


GAT 


TCG 


GCG 


GAA TCT 


GAC 


GCC 


GGA 


GAG 


CTC 


GGC 


GAG 


GAC 


gly pro 


gin 


val 


lys 


ala 


ala 


asp 


ser 


ala 


glu ser 


asp 


ala 


gly 


glu 


leu 


gly 


glu 


asp 


241/81 


















271/91 


















GCG TGC 


CCA 


GAA 


CAG 


GCC 


CTC 


GTC 


GAG 


CGG 


CGC CCG 


TCG 


CGG 


TTG 


CGG 


CGA 


GGC 


TGG 


CTT 


ala cys 


pro 


glu 


gin 


ala 


leu 


val 


glu 


arg 


arg pro 


ser 


arg 


leu 


arg 


arg 


gly 


trp 


leu 


301/101 


















331/111 


















GTT GGC 


ATT 


GCG 


GCG 


ACG 


CTG 


CTC 


GCG 


TTG 


GCC GGT 


GGC 


CTT 


GGC 


GCA 


GCG 


GGT 


TAT 


TTT 


val gly 


ile 


ala 


ala 


thr 


leu 


leu 


ala 


leu 


ala gly 


gly 


leu 


gly 


ala 


ala 


gly 


tyr 


phe 


361/121 


















391/131 


















GCG TTG 


CGC 


TCA 


CAC 


CAG 


GAA 


AGC 


CAA 


TCA 


ATC GCG 


CGC 


GAG 


GAC 


CTT 


GCG 


GCC 


ATT 


GAG 


ala leu 


arg 


ser 


his 


gin 


glu 


ser 


gin 


ser 


ile ala 


arg 


glu 


asp 


leu 


ala 


ala 


ile 


glu 


421/141 


















451/151 


















GCC GCT 


AAG 


GAT 


TGC 


GTT 


GCG 


GCC 


ACG 


CAG 


GCA CCC 


GAT 


GCT 


GGG 


GCG 


ATG 


TCG 


GCT 


AGC 


ala ala 


lys 


asp 


cys 


val 


ala 


ala 


thr 


gin 


ala pro 


asp 


ala 


gly 


ala 


met 


ser 


ala 


ser 


481/161 


















511/171 


















ATG CAG 


AAG 


ATC 


ATC 


GAG 


TGT 


GGC 


ACC 


GGT 


GAT TTC 


GGT 


GCC 


CAG 


GCG 


TCG 


TTG 


TAC 


ACC 


met gin 


lys 


ile 


ile 


glu 


cys 


gly 


thr 


gly 


asp phe 


gly 


ala 


gin 


ala 


ser 


leu 


tyr 


thr 


541/181 


















571/191 


















AGC ATG 


CTC 


GTC 


GAG 


GCG 


TAT 


CAA 


GCG 


GCC 


AGC GTC 


CAC 


GTG 


CAA 


GTG 


ACC 


GAT 


ATG 


CGC 


ser met 


leu 


val 


glu 


ala 


tyr 


gin 


ala 


ala 


ser val 


his 


val 


gin 


val 


thr 


asp 


met 


arg 


601/201 


















631/211 


















GCG GCG 


GTC 


GAG 


CGC 


AAC 


AAC 


AAT 


GAC 


GGG 


TCG GTC 


GAT 


GTT 


CTG 


GTG 


GCG 


CTC 


CGG 


GTC 


ala ala 


val 


glu 


arg 


asn 


asn 


asn 


asp 


gly 


ser val 


asp 


val 


leu 


val 


ala 


leu 


arg 


val 


661/221 


















691/231 


















AAG GTG 


TCC 


AAC 


ACC 


GAC 


TCG 


GAT 


GCC 


CAT 


GAA GTC 


GGC 


TAC 


CGT 


CTT 


CGG 


GTC 


CGG 


ATG 


lys val 


ser 


asn 


thr 


asp 


ser 


asp 


ala 


his 


glu val 


gly 


tyr 


arg 


leu 


arg 


val 


arg 


met 


721/241 


















751/251 


















GCA CTG 


GAT 


GAG 


GGC 


CGC 


TAT 


AAG 


ATC 


GCC 


AAA CTC 


GAC 


CAG 


GTG 


ACG 


AAG 


TGA 






ala leu 


asp 


glu 


gly 


arg 


tyr 


lys 


ile 


ala 


lys leu 


asp 


gin 


val 


thr 


lys) OPA 







SEQ ID NOS: 649-650 



FIG. 38E 




ACA CCT CCC CCC CCG CCG 
(thrpro pro pro pro pro 
61/21 " — SEQ ID NO: 652 

GGC GTG ATG CAA GGC TGC 
gly val met gin gly cys 
121/41 

ACC GCA CTG GTC GCC GAG 
thr ala leu val ala glu 



31/11 

CCG CTG CCG CCG GTT CCC TTT 
pro leu pro pro val pro phe 

CTT GAG AGC ACC AGC GGC TTG 
leu glu ser thr ser gly leu 
151/51 

CGC ATC ACC GGT GCC GTC GAG 
arg ile thr gly ala val glu 



CCC AAG GAA TGT CCG GCG CCG 
pro lys glu cys pro ala pro 
91/31 

ATC ATG GGC ATC GAC AGC AAG 
ile met gly ile asp ser lys 

GAG ATC 
glu ile) 



SEQ ID NOS: 651-652 

FIG. 39A 



1/l^SEQ ID NO: 653 

CAC CTC CCC CCC CGC CGC CGC TGC CGC CGG 
( his le u pro pro arg arg arg cys arg arg 
61/21 SEQ ID NO: 654 

GCG TGA TGC AAG GCT GCC TTG AGA GCA CCA 

ala)OPA(cys lys ala ala leu arg ala pro 
121/41 ^-SEQ ID NO: 655 

CCG CAC TGG TCG CCG AGC GCA TCA CCG GTG 

pro his trp ser pro ser ala ser pro val 



31/11 

TTC CCT TTC CCA AGG AAT GTC CGG CGC CGG 
phe pro phe pro arg asn val arg arg arg 
91/31 

GCG GCT TGA TCA TGG GCA TCG ACA GCA AGA 
ala ala)OPA(ser trp ala ser thr ala arg 
151/51 v — SEQ ID NO: 656 
CCG TCG AGG AGA TC 
pro ser arg arg) 



SEQ ID NOS: 653-656 

FIG. 39B 



r/l^SEQ ID NO: 657 

GAC ACC TCC CCC CCC GCC 
(ajspthr ser pro pro ala 
61/21^SEQ ID NO: 658 
GGG CGT GAT GCA AGG CTG 
gly arg asp ala arg leu 
121/41 SEQ NO: 

GAC CGC ACT GGT CGC CGA 
asp arg thr gly arg arg 



31/11 

GCC GCT GCC GCC GGT TCC CTT 
ala ala ala ala gly ser leu 
91/31 

CCT TGA GAG CAC CAG CGG CTT 
pro)OPA(glu his gin arg leu 

659 * 151/51 

GCG CAT CAC CGG TGC CGT CGA 
ala his his arg cys arg arg 



TCC CAA GGA ATG TCC GGC GCC 
ser gin gly met ser gly ala 

GAT CAT GGG CAT CGA CAG CAA 
asp his gly his arg gin gin 

GGA GAT C 
gly asp) 



SEQ ID NOS: 657-659 



FIG. 39C 




Coding sequence Rv3006 predicted by Cole et al, ( 1998 (Nature 393 537-544) and 
containing seq39A 



1 /1 

- " 


SEQ 


ID NO: 660 










■si /i i 

O 1 / 1 1 


















ATG TGG 


ACA 


ACG 


CGG 


TTG 


GTT 


CGA 


TCC 


GGA 


pmp PPP 




CTG 


TGC 


GCG 


GCA 


GTG 


CTG 


GTA 


lifl.ec crp 


thr 


thr 


arg 


leu 


val 


arg 


ser 


giy 


leu ala 


ala 

d la 


leu 


cys 


ala 


ala 


Val 


leu 


Val 




^SEQ ID 


NO t 


DDI 










J X. / O X 


















TCG ACjC 


cnc 




pp jv 


ppp 


iihup 
1 1L 


x t\p 
AAL- 




pprp 


p A A TtPT 1 


\-A\j 






TiPP 


ACC 


GAA 


CCG 


GAG 


ser ser 


gly 


cys 


aia 


arg 


phe 


asn 


asp 


aid 


gin bci 


gin 


pro 


phe 




enr 


glu 


pro 


glu 


1Z 1 / 4 1 


















1 CI / CI- 

131/ D 1 


















prnp PPP 


L,L-L- 


7A A 
L-AA 


ppp 


AvjL, 


rppp 


AL-A 






L»L,L, L,L»G 




ppp 


pmp 


ppp 


ppp 

t-t-G 


GTT 


CCC 


TTT 


leu arg 


pro 


gin 


pro 


ser 


ser 




pro 


pro 


pro pxtj 


pro 


pro 


leu 


pro 


pro 


val 


pro 


phe 


lol/ Ox 


















Oil /T1 
zli/ /I 


















LLL AAG 


GAA 


TGT 


ppp 

L-L,G 


ppp 


ppp 


ppp 


(""TV™* 
Lj 1 Vj 




pTV TV pPP 






pip 


app 


APP 

ALL 


APP 


ppp 


TTG 


pro lys 


glu 


cys 


pro 


aia 


pro 


giy 


Veil 


me t 


y xii y ±y 


cys 


leu 


glu 


ser 


t-IlX 


ser 


giy 


leu 


/41/ol 


















Z / 1 / " 1 


















ATC ATG 


GGC 


ATC 


GAC 


AGL- 


AAG 


ACC 


GCA 


CTG 


pmp /— >/~i/~« 


tjAvj 




ATC 


ACC 


GGT 


GCC 


GTC 


GAG 


ile met 


giy 


ile 


asp 


ser 


lys 


tnr 


aia 


leu 


va 1 ala 


glu 


arg 


ne 


tnr 


giy 


aia 


val 


glu 


JU1 / 1U1 


















T51 /111 
Jjl/ 111 


















GAG ATC 


TCT 


ATC 


AGC 


GCC 


GAG 


CCG 


AAG 


GTA 


AAG ACG 


GTC 


ATC 


CCC 


GTG 


GAT 


CCT 


GCC 


GGT 


glu ile 


ser 


ile 


ser 


ala 


giu 


pro 


lys 


vai 


lys thr 


ITS* 1 

vai 


ne 


pro 


vai 


asp 


pro 


ala 


giy 


361/121 


















jy 1/ i ji 


















GAC GGT 


GGC 


TTG 


ATG 


GAC 


ATT 


GTG 


CTG 


TCG 


CCC ACC 


TAC 


TCG 


CAA 


GAC 


CGG 


CTG 


ATG 


TAC 


asp gly 


gly 


leu 


met 


asp 


ne 


vai 


leu 


ser 


pro thr 


tyr 


ser 


gin 


asp 


arg 


leu 


met 


tyr 


421/141 


















4 Dl / 1 D 1 


















GCC TAC 


ATC 


AGC 


ACG 


CCC 


ACC 


GAC 


AAC 


CGG 


GTG GTG 


CGA 


GTG 


GCC 


GAC 


GGC 


GAC 


ATC 


CCC 


ala tyr 


ile 


ser 


thr 


pro 


thr 


asp 


asn 


arg 


va 1 va 1 


arg 


val 


aia 


asp 


giy 


asp 


ile 


pro 


481/161 


















511/171 


















AAG GAC 


ATC 


CTG 


ACC 


GGC 


ATC 


CCC 


AAA 


GGT 


GCT GCC 


GGT 


AAC 


ACC 


GGG 


GCG 


CTG 


ATC 


TTC 


lys asp 


ile 


leu 


thr 


gly 


ile 


pro 


lys 


giy 


ala ala 


gly 


asn 


4_ Vl -K- 


gly 


aia 


leu 


lie 


phe 


541/181 


















d / 1 / iy i 


















ACC AGT 


CCC 


ACC 


ACG 


CTG 


GTC 


GTG 


ATG 


ACC 


GGG GAT 


GCT 


GGC 


GAC 


CCG 


GCG 


TTG 


GCC 


GCC 


thr ser 


pro 


cnr 


tnr 


leu 


vai 


■*ra 1 

vai 


met 


enr 


gly asp 


ala 


giy 


asp 


pro 


aid 


leu 


aia 


ala 

aia 




















£•31 /Oil 
Dj1/Z11 


















GAT CCC 


p A A 


rnpp 
1 v_G 


TTG 


GCC 


GGT 


AAG 


GTC 


CTG 


ppm t\ rn/~» 


GAA 


v_ALt 




7\ pp 


APP 


ATC 


ppp 

<jLtL. 


PAP 


asp pro 


gin 


ser 


leu 


a 1 a 

aia 


giy 


lys 


Veil 


leu 


arg ile 


glu 


gin 


pro 


(-V, r 


thr 


ile 


giy 


gin 


DDI/ ^Zl 


















fiQ 1 /9 1 

D31/ 6 Jl 


















ACG CCG 


v_ Vj» 


ACG 


a pp 


ppp 




1a^ 1 




imp 






\J\J\-. 






TCC 


ATp 
A X L. 


PAT 1 
X 


CCC 
LLu 


thr pro 


P^o 




tnr 


c± J-C± 


leu 


ser 


giy 


ile 


gly ser 


gly 


gly 


y x jt 


1 eu 


cys 


ile 


asp 


pro 


/Zl/ All 


















751/251 


















P T*/*^ PAP 


GGC 


TCG 




TAT" 1 


VJ 1 v_. 


ace 


GAC 


CGC 




ACG 


GCG 


GAC 


CGA 


TTG 


CAG 


CGC 


ATC 


Val asp 


gly 


ser 


leu 


tyr 




ala 


asp 


arg 


thr pro 


thr 


ala 


asp 


arg 


leu 


gin 


arg 


ile 


781/261 


















811/271 


















ACC AAG 


AAC 


TCG 


pap 


\3 1 \~ 


TCT 


ACG 


GTA 


TGG 


ACC TGG 


CCG 


GAC 


AAG 


CCC 


GGC 


GTG 


GCC 


GGG 


thr lys 


asn 


ser 


glu 


val 


o o v 

BC1 


thr 


val 


l-x-1-> 


thr trp 


pro 


asp 




pro 




val 


ala 


alv 
y -*- jt 


841/281 


















871/291 


















TGT GCC 


GCG 


ATG 


GAC 


GGC 


ACC 


GTG 


CTG 


GTC 


AAC CTG 


ATT 


AAT 


ACC 


AAA 


CTG 


ACG 


GTG 


GCG 


cys ala 


ala 


met 


asp 


gly 


thr 


val 


leu 


val 


asn leu 


ile 


asn 


thr 


lys 


leu 


thr 


val 


ala 


901/301 


















931/311 


















GTC CGG 


CTC 


GCG 


CCG 


TCG 


ACC 


GGT 


GCG 


GTC 


ACC GGA 


GAA 


CCC 


GAC 


GTT 


GTC 


CGC 


AAA 


GAC 


val arg 


leu 


ala 


pro 


ser 


thr 


giy 


ala 


val 


thr gly 


glu 


pro 


asp 


val 


val 


arg 


lys 


asp 


961/321 


















991/331 


















ACT CAT 


GCG 


CAT 


GCG 


TGG 


GCA 


TTA 


CGG 


ATG 


TCG CCG 


GAC 


GGC 


AAC 


GTC 


TGG 


GGA 


GCC 


ACC 


thr his 


ala 


his 


ala 


trp 


ala 


leu 


arg 


met 


ser pro 


asp 


gly 


asn 


val 


trp 


giy 


ala 


thr 


1021/341 
















1051/351 
















GTC AAC 


AAG 


ACC 


GCC 


GGC 


GAC 


GCC 


GAG 


AAG 


CTC GAC 


GAT 


GTG 


GTG 


TTC 


CCG 


CTG 


TTC 


CCG 


val asn 


lys 


thr 


ala 


giy 


asp 


ala 


glu 


lys 


leu asp 


asp 


val 


val 


phe 


pro 


leu 


phe 


pro 


1081/361 
















1111/371 
















CAG GGT 


GGC 


GGC 


TTC 


CCG 


CGC 


AAC 


AAC 


GAC 


GAC AAG 


ACC 


TGA 














gin gly 


gly 


gly 


phe 


pro 


arg 


asn 


asn 


asp 


asp lys 


thr)OPA 















SEQ XD NOS: 660-661 



FIG. 39D 




ORF according to Cole et al . , 1998 (Nature 393 537-544) and containing Rv3006 



1/L__^SEQ ID NO 


: 662 










31/11 


















TAA GGC 


CAT 


TTA 


GTG 


CCG 


AAT 


TGG 


GGA 


TTT 


GAG CGG 


CGC 


TTT 


CGC 


CAG 


ACA 


ATC 


CGC 


ACA 


OCH(gly 


his 


leu 


val 


pro 


asn 


trp 


gly 


phe 


glu arg 


arg 


phe 


arg 


gin 


thr 


ile 


arg 


thr 


61/2i^~ 


-SEQ 


ZD NO: 663 










91/31 


















TTG ACC 


CTG 


ACC 


AGC 


CCA 


CCA 


AAA 


GGC 


CCC 


AAT TGG 


GCC 


GCC 


ATG 


CCG 


ACA 


GTG 


CGC 


ACC 


leu thr 


leu 


thr 


ser 


pro 


pro 


lys 


gly 


pro 


asn trp 


ala 


ala 


met 


pro 


thr 


val 


arg 


thr 


121/41 


















151/51 


















CCG GCA 


GGT 


GGC 


GGC 


GAT 


GCC 


CAC 


AAT 


GTC 


CGT AGC 


CTG 


TCG 


GTC 


ATG 


TGG 


ACA 


ACG 


CGG 


pro ala 


gly 


gly 


gly 


asp 


ala 


his 


asn 


val 


arg ser 


leu 


ser 


val 


met 


trp 


thr 


thr 


arg 


181/61 


















211/71 


















TTG GTT 


CGA 


TCC 


GGA 


CTC 


GCC 


GCG 


CTG 


TGC 


GCG GCA 


GTG 


CTG 


GTA 


TCG 


AGC 


GGC 


TGC 


GCA 


leu val 


arg 


ser 


gly 


leu 


ala 


ala 


leu 


cys 


ala ala 


val 


leu 


val 


ser 


ser 


gly 


cys 


ala 


241/81 


















271/91 


















CGG TTC 


AAC 


GAC 


GCT 


CAA 


TCT 


CAG 


CCG 


TTC 


ACC ACC 


GAA 


CCG 


GAG 


CTG 


CGG 


CCC 


CAA 


CCC 


arg phe 


asn 


asp 


ala 


gin 


ser 


gin 


pro 


phe 


thr thr 


glu 


pro 


glu 


leu 


arg 


pro 


gin 


pro 


301/101 


















331/111 


















AGC TCG 


ACA 


CCT 


CCC 


CCC 


CCG 


CCG 


CCG 


CTG 


CCG CCG 


GTT 


CCC 


TTT 


CCC 


AAG 


GAA 


TGT 


CCG 


ser ser 


thr 


pro 


pro 


pro 


pro 


pro 


pro 


leu 


pro pro 


val 


pro 


phe 


pro 


lys 


glu 


cys 


pro 


361/121 


















391/131 


















GCG CCG 


GGC 


GTG 


ATG 


CAA 


GGC 


TGC 


CTT 


GAG 


AGC ACC 


AGC 


GGC 


TTG 


ATC 


ATG 


GGC 


ATC 


GAC 


ala pro 


gly 


val 


met 


gin 


gly 


cys 


leu 


glu 


ser thr 


ser 


gly 


leu 


ile 


met 


gly 


ile 


asp 


421/141 


















451/151 


















AGC AAG 


ACC 


GCA 


CTG 


GTC 


GCC 


GAG 


CGC 


ATC 


ACC GGT 


GCC 


GTC 


GAG 


GAG 


ATC 


TCT 


ATC 


AGC 


ser lys 


thr 


ala 


leu 


val 


ala 


glu 


arg 


ile 


thr gly 


ala 


val 


glu 


glu 


ile 


ser 


ile 


ser 


481/161 


















511/171 


















GCC GAG 


CCG 


AAG 


GTA 


AAG 


ACG 


GTC 


ATC 


CCC 


GTG GAT 


CCT 


GCC 


GGT 


GAC 


GGT 


GGC 


TTG 


ATG 


ala glu 


pro 


lys 


val 


lys 


thr 


val 


ile 


pro 


val asp 


pro 


ala 


gly 


asp 


gly 


gly 


leu 


met 


541/181 


















571/191 


















GAC ATT 


GTG 


CTG 


TCG 


CCC 


ACC 


TAC 


TCG 


CAA 


GAC CGG 


CTG 


ATG 


TAC 


GCC 


TAC 


ATC 


AGC 


ACG 


asp ile 


val 


leu 


ser 


pro 


thr 


tyr 


ser 


gin 


asp arg 


leu 


met 


tyr 


ala 


tyr 


ile 


ser 


thr 


601/201 


















631/211 


















CCC ACC 


GAC 


AAC 


CGG 


GTG 


GTG 


CGA 


GTG 


GCC 


GAC GGC 


GAC 


ATC 


CCC 


AAG 


GAC 


ATC 


CTG 


ACC 


pro thr 


asp 


asn 


arg 


val 


val 


arg 


val 


ala 


asp gly 


asp 


ile 


pro 


lys 


asp 


ile 


leu 


thr 


661/221 


















691/231 


















GGC ATC 


CCC 


AAA 


GGT 


GCT 


GCC 


GGT 


AAC 


ACC 


GGG GCG 


CTG 


ATC 


TTC 


ACC 


AGT 


CCC 


ACC 


ACG 


gly ile 


pro 


lys 


gly 


ala 


ala 


gly 


asn 


thr 


gly ala 


leu 


ile 


phe 


thr 


ser 


pro 


thr 


thr 


/zl/z41 


















751/251 


















CTG GTC 


GTG 


ATG 


ACC 


GGG 


GAT 


GCT 


GGC 


GAC 


CCG GCG 


TTG 


GCC 


GCC 


GAT 


CCC 


CAA 


TCG 


TTG 


leu val 


val 


met 


thr 


gly 


asp 


ala 


gly 


asp 


pro ala 


leu 


ala 


ala 


asp 


pro 


gin 


ser 


leu 


781/261 


















811/271 


















GCC GGT 


AAG 


GTC 


CTG 


CGT 


ATC 


GAA 


CAG 


CCC 


ACC ACC 


ATC 


GGC 


CAG 


ACG 


CCG 


CCG 


ACG 


ACG 


ala gly 


lys 


val 


leu 


arg 


ile 


glu 


gin 


pro 


thr thr 


ile 


gly 


gin 


thr 


pro 


pro 


thr 


thr 


841/281 


















871/291 


















GCG CTG 


TCT 


GGC 


ATC 


GGC 


TCC 


GGC 


GGC 


GGC 


TTG TGC 


ATC 


GAT 


CCG 


GTC 


GAC 


GGC 


TCG 


CTA 


ala leu 


ser 


gly 


ile 


gly 


ser 


gly 


gly 


gly 


leu cys 


ile 


asp 


pro 


val 


asp 


gly 


ser 


leu 


901/301 


















931/311 


















TAT GTC 


GCC 


GAC 


CGC 


ACG 


CCA 


ACG 


GCG 


GAC 


CGA TTG 


CAG 


CGC 


ATC 


ACC 


AAG 


AAC 


TCG 


GAG 


tyr val 


ala 


asp 


arg 


thr 


pro 


thr 


ala 


asp 


arg leu 


gin 


arg 


ile 


thr 


lys 


asn 


ser 


glu 



SEQ XD NOS: 662-663 



FIG. 39E 



961/321 

GTC TCT ACG GTA TGG ACC TGG CCG GAC AAG 

val ser thr val trp thr trp pro asp lys 
1021/341 

GGC ACC GTG CTG GTC AAC CTG ATT AAT ACC 

gly thr val leu val asn leu ile asn thr 
1081/361 

TCG ACC GGT GCG GTC ACC GGA GAA CCC GAC 

ser thr gly ala val thr gly glu pro asp 
1141/381 

TGG GCA TTA CGG ATG TCG CCG GAC GGC AAC 

trp ala leu arg met ser pro asp gly asn 
1201/401 

GGC GAC GCC GAG AAG CTC GAC GAT GTG GTG 

gly asp ala glu lys leu asp asp val val 
1261/421 

CCG CGC AAC AAC GAC GAC AAG ACC TGA 
pro arg asn asn asp asp lys thr)OPA 



SEQ ID NOSs662 

FSGo 39E 



991/331 
















CCC GGC GTG 


GCC 


GGG 


TGT 


GCC 


GCG 


ATG 


GAC 


pro gly val 


ala 


gly 


cys 


ala 


ala 


met 


asp 


1051/351 
















AAA CTG ACG 


GTG 


GCG 


GTC 


CGG 


CTC 


GCG 


CCG 


lys leu thr 


val 


ala 


val 


arg 


leu 


ala 


pro 


1111/371 
















GTT GTC CGC 


AAA 


GAC 


ACT 


CAT 


GCG 


CAT 


GCG 


val val arg 


lys 


asp 


thr 


his 


ala 


his 


ala 


1171/391 
















GTC TGG GGA 


GCC 


ACC 


GTC 


AAC 


AAG 


ACC 


GCC 


val trp gly 


ala 


thr 


val 


asn 


lys 


thr 


ala 


1231/411 
















TTC CCG CTG 


TTC 


CCG 


CAG 


GGT 


GGC 


GGC 


TTC 


phe pro leu 


phe 


pro 


gin 


gly 


gly 


gly 


phe 



663 (continued) 
(continued) 



1 /1 ^ SEQ IP NO: 664 31/11 

GAA GGC CTT GTT GAG CCG GCG CAC GAA AAC GAT CGT TGT GTG TAC 

(g lu g ly leu val glu pro ala his glu asn asp arg cys val tyr 

61*721 SEQ ID NO: 665 91/31 

TCG GTT GAA CGT GTA TGT GCC CGA CGA ATT GGC GGA GCG CGC CAG 

ser val glu arg val cys ala arg arg ile gly gly ala arg gin 

121/41 151/51 

CGT CTC GGC GCT GAC TCA GGC CGC GAT CAG TGC CGA GTT GGA GAA 

arg leu gly ala asp ser gly arg asp gin cys arg val gly glu 

181/61 211/71 

GTG GCT TGA GGG GTT GGA ACC CAG AAG CAC CGG CGC TCG GCA TGA 

val ala)OPA(gly val gly thr gin lys his arg arg ser ala)OPA 

241/81 ^SEQ ID NO: 931 271/91 

GAT CGA TGC CGC TCG CGA TGA GTT CGA AGC GTG AGA GCA TCG CCC 

asp arg cys arg ser arg)OPA(val arg ser val arg ala ser pro 

301/101 SEQ ID NO: 933- — ^ 331/111 

CAG GTG GTC GTC GAC GCG AGT GCC ATG GTG GAT C 

gin val val val asp ala ser ala met val asp) 



ATT 


GGT 


GTG 


TAT 


GGC 


ile 


gly 


val 


tyr 


gly 


GGC 


GCG 


GGG 


CTT 


GAA 


gly 


ala 


gly 


leu 


glu 


CTC 


CGC 


AAC 


CGA 


TGC 


leu 


arg 


asn 


arg 


cys 


TGA 


CGT 


GCT 


GGG 


TGC 


OPA(arg 


ala 


gly 


cys 




^SEQ ID 


NO: 


932 


ACT 


TCG 


CCG 


CCG 


GAG 


thr 


ser 


pro 


pro 


glu 



SEQ ID NOSs664-665, 931-933 

FIG, 40A 



V 




1/1 SEQ ID NO: 


666 










31/11 


















AAG GCC 


TTG 


TTG 


AGC 


CGG 




ACG 


AAA 


ACG 


ATC GTT 


GTG 


TGT 


ACA 


TTG 


GTG 


TGT 


ATG 


GCT 


(lys ala 


leu 


leu 


ser 


arg 


arg 


thr 


lvs 


thr 


ile val 


val 


cys 


thr 


leu 


val 


cys 


met 


ala 


61/21 SEQ ID NO: 667 










91/31 


















CGG TTG 


AAC 


GTG 


TAT 


GTG 


l— V— v_ 


HAP 


GAA 


TTG 


nrci gag 


CGC 


GCC 


AGG 


GCG 


CGG 


GGC 


TTG 


AAC 


arg leu 


asn 


val 


tyr 


val 


pro 




glu 


leu 


ala glu 


arg 


ala 


arg 


ala 


arg 


crl v 


leu 


asn 


121/41 


















151/51 


















GTC TCG 


GCG 


CTG 


ACT 


CAG 


GCC 


GCG 


ATC 


AGT 


GCC GAG 


TTG 


GAG 


AAC 


TCC 


GCA 


ACC 


GAT 


GCG 


val ser 


ala 


leu 


thr 


gin 


ala 


ala 


ile 


ser 


ala glu 


leu 


glu 


asn 


ser 


ala 


thr 


asp 


ala 


181/61 


















211/71 


















TGG CTT 


GAG 


GGG 


TTG 


GAA 


ccc 


AGA 


AGC 


ACC 


GGC GCT 


CGG 


CAT 


GAT 


GAC 


GTG 


CTG 


GGT 


GCG 


trp leu 


glu 


gly 


leu 


glu 


pro 


arg 


ser 


thr 


gly ala 


arg 


his 


asp 


asp 


val 


leu 


gly 


ala 


241/81 


















271/91 


















ATC GAT 


GCC 


GCT 


CGC 


GAT 


GAG 


TTC 


GAA 


GCG 


TGA GAG 


CAT 


CGC 


CCA 


CTT 


CGC 


CGC 


CGG 


AGC 


ile asp 


ala 


ala 


arg 


asp 


glu 


phe 


glu 


ala)OPA(glu 


his 


arg 


pro 


leu 


arg 


arg 


arg 


ser 


301/101 


















331/111 


SEQ ID 


NO: 


668 










AGG TGG 


TCG 


TCG 


ACG 


CGA 


GTG 


CCA 


TGG 


TGG 


ATC 


















arg trp 


ser 


ser 


thr 


arg 


val 


pro 


trp 


trp 


ile) 



















SEQ ID NOSs €66-668 




1/1^-—-- SEQ ID NO: 669 

AGG CCT TGT TGA GCC GGC GCA CGA AAA 
arg pro cys OPA(ala gly ala arg lys 
61/21 SEQ ID NO: 670 

GGT TGA ACG TGT ATG TGC CCG ACG AAT 
gly)OPA(thr cys met cys pro thr asn 
121/41 ^SEQ ID NO: 671 
TCT CGG CGC TGA CTC AGG CCG CGA TCA 
ser arg arg)OPA(leu arg pro arg ser 
181/61 ^SEQ ID NO: 673 

GGC TTG AGG GGT TGG AAC CCA GAA GCA 
gly leu arg gly trp asn pro glu ala 
241/81 

TCG ATG CCG CTC GCG ATG AGT TCG AAG 
ser met pro leu ala met ser ser lys 
301/101 

GGT GGT CGT CGA CGC GAG TGC CAT GGT 
gly gly arg arg arg glu cys his gly 



31/11 

CGA TCG TTG TGT GTA CAT TGG TGT GTA TGG CTC 
arg ser leu cys val his trp cys val trp leu 
91/31 

TGG CGG AGC GCG CCA GGG CGC GGG GCT TGA ACG 
trp arg ser ala pro gly arg gly ala)OPA(thr 

151/51 SEQ ID NO: 672 —*" 

GTG CCG AGT TGG AGA ACT CCG CAA CCG ATG CGT 
val pro ser trp arg thr pro gin pro met arg 

211/71 

CCG GCG CTC GGC ATG ATG ACG TGC TGG GTG CGA 
pro ala leu gly met met thr cys trp val arg 
271/91 

CGT GAG AGC ATC GCC CAC TTC GCC GCC GGA GCA 
arg glu ser ile ala his phe ala ala gly ala 

GGA TC 

gly) 



SEQ ID NOSs 669-673 



FIG. 40C 



.J 1 



Coding sequence Rv0549c predicted by Cole et al . , 1998 (Nature 393:537-544) and 
containing seq40A 

1/1___— SEQ ID NO: 674 31/11 

gtg aga gca teg ccc act teg ccg ccg gag cag gtg gtc gtc gac gcg agt gec atg gtg 
(valarg ala ser pro thr ser pro pro glu gin val val val asp ala ser ala met val 
6T72T~^-SEQ ID NO: 675 91/31 

gat eta ctg get cgc act age gat egg tgc tct gcg gtg cgc gcg egg ctg get egg acc 
asp leu leu ala arg thr ser asp arg cys ser ala val arg ala arg leu ala arg thr 
121/41 151/51 

gcg atg cac gcg ccg gcg cac ttc gat gca gag gtg ttg teg gcg ctg ggg cgc atg cag 
ala met his ala pro ala his phe asp ala glu val leu ser ala leu gly arg met gin 
181/61 211/71 

cgc gee ggc gca etc acc gtt gee tat gtc gat gcg gca ctg gag gag ttg cga cag gtg 
arg ala gly ala leu thr val ala tyr val asp ala ala leu glu glu leu arg gin val 
241/81 271/91 

ccg gtg act cga cac ggt ctt teg teg ctg ctt get gga gcg tgg teg cgc cgc gac acc 
pro val thr arg his gly leu ser ser leu leu ala gly ala trp ser arg arg asp thr 
301/101 331/111 

etc cgc ctg acc gat gee etc tac gtc gag ctg gec gaa acg gca ggt ctg gtg ttg ttg 
leu arg leu thr asp ala leu tyr val glu leu ala glu thr ala gly leu val leu leu 
361/121 391/131 

acc acc gac gaa aga ttg gca cgc gee tgg ccc teg get cac gec ate ggc tga 
thr thr asp glu arg leu ala arg ala trp pro ser ala his ala ile gly)OPA 



SEQ ID KTOSs 674-675 

FIG. 40D 

ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv0549c 
1/l^.SEQ ID NO: 676 31/11 

tga gtt cga age gtg aga gca teg ccc act teg ccg ccg gag cag gtg gtc gtc gac gcg 
0PA(val arg ser val arg ala ser pro thr ser pro pro glu gin val val val asp ala 
61/2? 'SEQ ID NO: 677 91/31 

agt gec atg gtg gat eta ctg get cgc act age gat egg tgc tct gcg gtg cgc gcg egg 
ser ala met val asp leu leu ala arg thr ser asp arg cys ser ala val arg ala arg 
121/41 151/51 

ctg get egg acc gcg atg cac gcg ccg gcg cac ttc gat gca gag gtg ttg teg gcg ctg 
leu ala arg thr ala met his ala pro ala his phe asp ala glu val leu ser ala leu 
181/61 211/71 

ggg cgc atg cag cgc gec ggc gca etc acc gtt gec tat gtc gat gcg gca ctg gag gag 
gly arg met gin arg ala gly ala leu thr val ala tyr val asp ala ala leu glu glu 
241/81 271/91 

ttg cga cag gtg ccg gtg act cga cac ggt ctt teg teg ctg ctt get gga gcg tgg teg 
leu arg gin val pro val thr arg his gly leu ser ser leu leu ala gly ala trp ser 
301/101 331/111 

cgc cgc gac acc etc cgc ctg acc gat gee etc tac gtc gag ctg gec gaa acg gca ggt 
arg arg asp thr leu arg leu thr asp ala leu tyr val glu leu ala glu thr ala gly 
361/121 _ 391/131 

ctg gtg ttg "ttg acc acc gac gaa aga ttg gca cgc gee tgg ccc teg get cac gec ate 
leu val leu leu thr thr asp glu arg leu ala arg ala trp pro ser ala his ala ile 

42i/r4'i 

ggc tga > 
gly)OPA 

SEQ ID NOSs 676-677 



FIG. 40E 



SEQ ID NO:678 31/11 

CCT GGC CGG GAC GCC TAC GTG TAG CCC GCG GCT AGC AC A GGA TAG CCA TTG TTG TGC GGT 
(p ro g ly arg asp ala tyr val)AMB(pro ala ala ser thr gly) AMB (pro leu leu cys gly 
6T723r^-SEQ ID NO: 679 SEQ ID^NO: 680 91/31 ^SEQ ID NO: 681 



AGC GCC 


AAA 


ACG 


ATC 


AGC 


CCT 


TCG 


CGG 


ACA 


TGT CAG 


CAC 


CCG 


CCT 


TGG 










ser ala 


lys 


thr 


ile 


ser 


pro 


ser 


arg 


thr 


cys gin 


his 


pro 


pro 


trp 


pro 


gly 


glu 


arg 


121/41 


















151/51 


















CGT CGT 


GAC 


CGT 


GCT 


GTC 


ACC 


ACG 


TCT 


GGT 


TAG GCT 


CGG 


GGC 


GCG 


GGC 


TGG 


CGC 


GGA 


GGA 


arg arg 


asp 


arg 


ala 


val 


thr 


thr 


ser 


gly)AMB(ala 


arg 


gly 


ala 


gly 


trp 


arg 


gly 


gly 


181/61 


















211/fr^ 


SEQ 


ID NO: 682 










GGT GTG 


TTG 


CGG 


AGG 


AGG 


TGT 


GTT 


GTA 


GTG 


GGG ACG 


GCG 


GAT 


CGG 


CCG 


TTG 


GAC 


GCC 


TCG 


gly val 


leu 


arg 


arg 


arg 


cys 


val 


val 


val 


gly thr 


ala 


asp 


arg 


pro 


leu 


asp 


ala 


ser 


241/81 


















271/91 


















GCC TTG 


CGG 


GAC 


TGG 


GCA 


CAC 


GCC 


GTC 


GTC 


AGC GAT 


C 
















ala leu 


arg 


asp 


trp 


ala 


his 


ala 


val 


val 


ser asp) 




























SEQ ID NOSS678- 


682 






























FIGo 41 A 


















1/1„ — SEQ ID NO: 


: 683 










31/11 


















CTG GCC 


GGG 


ACG 


CCT 


ACG 


TGT 


AGC 


CCG 


CGG 


CTA GCA 


CAG 


GAT 


AGC 


CAT 


TGT 


TGT 


GCG 


GTA 


(leu ala 


gly 


thr 


pro 


thr 


cys 


ser 


pro 


arg 


leu ala 


gin 


asp 


ser 


his 


cys 


cys 


ala 


val 


61/21^SEQ ID NO: 684 










91/31 


















GCG CCA 


AAA 


CGA 


TCA 


GCC 


CTT 


CGC 


GGA 


CAT 


GTC AGC 


ACC 


CGC 


CTT 


GGC 


CGG 


GAG 


AGC 


GGC 


ala pro 


lys 


arg 


ser 


ala 


leu 


arg 


gly 


his 


val ser 


thr 


arg 


leu 


gly 


arg 


glu 


ser 


gly 


121/41 


















151/51 


















GTC GTG 


ACC 


GTG 


CTG 


TCA 


CCA 


CGT 


CTG 


GTT 


AGG CTC 


GGG 


GCG 


CGG 


GCT 


GGC 


GCG 


GAG 


GAG 


val val 


thr 


val 


leu 


ser 


pro 


arg 


leu 


val 


arg leu 


gly 


ala 


arg 


ala 


gly 


ala 


glu 


glu 


181/61 


















211/71 


















GTG TGT 


TGC 


GGA 


GGA 


GGT 


GTG 


TTG 


TAG 


TGG 


GGA CGG 


CGG 


ATC 


GGC 


CGT 


TGG 


ACG 


CCT 


CGG 


val cys 


cys 


gly 


gly 


gly 


val 


leu) AMB (trp 


gly arg 


arg 


ile 


gly 


arg 


trp 


thr 


pro 


arg 


241/81 








SEQ 


ID NO: i 


585-^ 


271/91 


















CCT TGC 


GGG 


ACT 


GGG 


CAC 


ACG 


CCG 


TCG 


TCA 


GCG ATC 


















pro cys 


gly 


thr 


gly 


his 


thr 


pro 


ser 


ser 


ala ile) 

















SEQ ID NOSs 683-685 



FIG. 41 B 



1/J^,SEQ ID NO: 686 

TGG CCG GGA CGC CTA CGT GTA GCC CGC 
( trp p ro gly arg leu arg val ala arg 
61/21 ^SEQ ID NO: 687 

CGC CAA AAC GAT CAG CCC TTC GCG GAC 
( arg gin asn asp gin pro phe ala asp 
121/41^SEQ ID NO: 689 

TCG TGA CCG TGC TGT CAC CAC GTC TGG 
ser)OPA(pro cys cys his his val trp 
181/61 ^ — SEQ ID NO: 690 

TGT GTT GCG GAG GAG GTG TGT TGT AGT 
cys val ala glu glu val cys cys ser 
241/81 

CTT GCG GGA CTG GGC ACA CGC CGT CGT 
leu ala gly leu gly thr arg arg arg 



31/11 

GGC TAG CAC AGG ATA GCC ATT GTT GTG CGG TAG 
gly)AMB(his arg ile ala ile val val arg)AMB 

91/31°^-SEQ ID NO: 688 
ATG TCA GCA CCC GCC TTG GCC GGG AGA GCG GCG 
met ser ala pro ala leu ala gly arg ala ala 

151/51 

TTA GGC TCG GGG CGC GGG CTG GCG CGG AGG AGG 
leu gly ser gly arg gly leu ala arg arg arg 
211/71 

GGG GAC GGC GGA TCG GCC GTT GGA CGC CTC GGC 
gly asp gly gly ser ala val gly arg leu gly 

271/91 
CAG CGA TC 
gin arg) 



SEQ ID NOSs686-690 

FIG. 41 C 



Coding sequence Rv2975c predicted by Cole et al, 1998 (Nature 393: 537-544) and 
containing seq41A 



1/1^^-SEQ ID NO: 691 

gtg ggg acg gcg gat egg ccg ttg gac gec 
( val g ly thr ala asp arg pro leu asp ala 
61/21^SEQ ID NO: 692 

gtc age gat ctg ate etc cac ate gac gag 
val ser asp leu ile leu his ile asp glu 
121/41 

gac tec gat acc ggc gtc aac atg ctg ttc 
asp ser asp thr gly val asn inet leu phe 
181/61 

ttg cac gcg aat teg cag get gac gec gaa 
leu his ala asn ser gin ala asp ala glu 
241/81 

gec ggc gcg cgt tga 
ala gly ala arg)OPA 



31/11 

teg gee ttg egg gac tgg gca cac gec gtc 
ser ala leu arg asp trp ala his ala val 
91/31 

ate aac egg etc aat gtg ttc ccg gtc get 
ile asn arg leu asn val phe pro val ala 
151/51 

acc atg cgt gee gcg gtc gta gaa get gat 
thr met arg ala ala val val glu ala asp 
211/71 

gac gtg gcg egg gtt gcg gee get etc gcg 
asp val ala arg val ala ala ala leu ala 



SEQ ID NOSs 691-692 



FIG. 41 D 



ORF according to Cole et al, 1998 (Nature 393: 537-544) and containing Rv2975c 



1/1^-SEQ ID NO: 


693 










31/11 


















L- Q-y y l. 


egg 


ggc 


cirri 


y y 


tgg 


cgc 


gga 


gga 


ggt- ytg 


i_ i— y 


cgy 


cayy 


ciyy 


i_y t_ 


y u u 


gta 


y *-y 


ill 11J \ GL -i- CL 


arg 


gly 


ala 


y j 


t rp 


arg 


y j-_y 


y y 


rr~l "v \7P=i 1 


1 eu 


arg 


arg 


arg 


cys 


val 


val 


val 


61 / 2 1*°" — 


— SEQ ID 


NO: 


694 










91/31 


















yyy ci<~y 


gcg 


gat 


rrrrr 


ccg 


t- t-rr 




gec 


teg 


y \— i l. i— y 


oner 
<~y y 


gac 


L.y y 


gc a 


cae 


gec 


gtc 


gtc 


gly thr 


ala 


asp 


p» TTI 


pro 


1 eu 


asp 


ala 


ser 


=1 "1 "1 C»l I 

C* J- CI JL w KJL 


arg 


asp 


trp 


ala 


his 


ala 


val 


val 


121/41 


















151/51 


















age gat 


ctg 


ate 


etc 


cac 


ate 


gae 


y ay 


ate 


a <TTn 


etc 


aat 


y i-y 


ttc 




cr t e 




gac 


ser asp 


leu 


ile 


leu 


his 


ile 


asp 


glu 


ile 


asn arg 


leu 


asn 


val 


phe 


pro 


val 


ala 


asp 


181/61 


















211/71 


















tec gat 


acc 


ggc 


gtc 


aac 


atg 


ctg 


ttc 


acc 


atg cgt 


gec 


gcg 


gtc 


gta 


gaa 


get 


gat 


ttg 


ser asp 


thr 


gly 


val 


asn 


met 


leu 


phe 


thr 


met arg 


ala 


ala 


val 


val 


glu 


ala 


asp 


leu 


241/81 


















271/91 


















cac gcg 


aat 


teg 


cag 


get 


gac 


gee 


gaa 


gac 


gtg gcg 


egg 


gtt 


gcg 


gec 


get 


etc 


gcg 


gee 


his ala 


asn 


ser 


gin 


ala 


asp 


ala 


glu 


asp 


val ala 


arg 


val 


ala 


ala 


ala 


leu 


ala 


ala 


301/101 




































ggc gcg 


cgt 


tga 
































gly ala 


arg)0PA 

































SEQ ID NOSs693-694 

FIG. 41 E 



sequence Rv 2974C predicted by Cole et al. (Nature 393:537-544) and which may be 
in the same reading frame as Seq41D. The sequencing of this region reveals, in 
one case out of three, a deletion of two nucleotides putting in phase observed 
in 



1/1__SEQ ID NO: 695 










31/11 


















ttg aac 


gga 


get 


cgc 


ggc 


aac 


tec 


ggc 


gtg 


ate ctg 


tec 


cag 


ate 


ctg 


cgc 


ggg 


ate 


gca 


(leu asn 


gly 


ala 


arg 


gly 


asn 


ser 


gly 


val 


ile leu 


ser 


gin 


ile 


leu 


arg 


gly 


ile 


ala 


61/21 - 


SEQ ID 


NO: 


696 










91/31 


















gag gtg 


acc 


gcg 


act 


gcg 


gee 


gec 


gee 


tct 


ggc gcg 


gta 


ttg 


egg 


gcg 


gtc 


gac 


gee 


aac 


glu val 


thr 


ala 


thr 


ala 


ala 


ala 


ala 


ser 


gly ala 


val 


leu 


arg 


ala 


val 


asp 


ala 


asn 


121/41 


















151/51 


















gee etc 


ggg 


gee 


gcg 


ttg 


tgg 


cgc 


ggc 


gtc 


gag ttg 


gtc 


gtc 


gcg 


teg 


atg 


ggt 


ggc 


gtg 


ala leu 


gly 


ala 


ala 


leu 


trp 


arg 


gly 


val 


glu leu 


val 


val 


ala 


ser 


met 


gly 


gly 


val 


181/61 


















211/71 


















gag gtg 


ccg 


gga 


act 


ate 


gtc 


teg 


gtg 


ctg 


egg gee 


gee 


gec 


gga 


gec 


gtc 


gac 


cag 


tgc 


glu val 


pro 


gly 


thr 


ile 


val 


ser 


val 


leu 


arg ala 


ala 


ala 


gly 


ala 


val 


asp 


gin 


cys 


241/81 


















271/91 


















gcg cac 


gag 


ggg 


ttg 


gec 


ggt 


gcg 


gtc 


acc 


gee gee 


ggt 


gac 


gcg 


gcg 


gtc 


ate 


gcg 


ctg 


ala his 


glu 


gly 


leu 


ala 


gly 


ala 


val 


thr 


ala ala 


gly 


asp 


ala 


ala 


val 


ile 


ala 


leu 


301/101 


















331/111 


















gaa aag 


acc 


ccc 


gaa 


cag 


ctt 


gac 


gtg 


etc 


gec gat 


gcg 


ggc 


gcg 


gtg 


gac 


gee 


ggc 


gga 


glu lys 


thr 


pro 


glu 


gin 


leu 


asp 


val 


leu 


ala asp 


ala 


gly 


ala 


val 


asp 


ala 


gly 


gly 



SEQ ID NOSs 695-696 



FIG. 41 F 




fil / 1 91 

v-> -L / X X 


















391/131 




















c tg 


ctg 


att 


ctg 


ctg 


gac 


crccr 


ttg 


cgc tec 


acc 


ate 


tgc 


ggg 


cag 


gca 


cc t 


gcc 


dx y y-"-y 


leu 


leu 


val 


leu 


leu 


asp 


ala 


leu 


arg ser 


thr 


ile 


CVS 


crlv 


gin 


ala 


pro 


ala 


421 / 141 


















451/151 


















*-yy y^*y 


y U( ~ 


tac 


gaa 


ccc 


tCCf 


cca 


cgc 


crccr 


ttg ccg 


acc 


gac 


acg 


get 


acc 


caa 


cgc 


ccc 


d J_ y cl la 


val 


i_yx 


y -L U- 


pro 


ser 


pro 


arg 


ala 


leu pro 


thr 


asp 


thr 


ala 


thr 


gin 


arg 


pro 


*± O X / X \J X 


















511/171 


















CYpp r~> /*"• fr 

y ^— y 


caa 


ttc 


y ciy 


rri~ cr 

y *-y 


atg 


tat 


ctg 


ttg 


crccr crta 
y >— y y 0 


tgt 


gat 


get 


gca 


crccr 
a a 


crccr 
=j =j 


gac 


cag 


3.I3. pro 


gin 


7*1 Vl O 


glu 


val 


met 


uy x 


leu 


1 eu 


CL J~ CL V CL «L 


cys 


asp 


ala 


ala 


ala 


ala 


asp 


gin 


_J *± X / X O X 


















571/191 


















l. i— y y 


gat 


Lya 


etc 


aag 


gaa 


ttg 


yy L - 


cracr 

y cxy 


tea crtcr 

i_\_y y t_y 


gcc 


ate 


acc 


act 


ere t 


cccr 


ccc 


gac 


leu arg 


asp 


arg 


1 eu 


xyt> 


rrl 1 1 
y x u. 


1 eu 


y ± y 


glu 


ser val 


ala 


ile 


ala 


ala 


ala 


pro 


pro 


asp 


£01 /901 

DUX/ ^ V X 


















631/211 


















cLCfC tdC 


tec 


y ^-ci 


cac 


gtc 


cac 


acc 


gac 


gac 


ncc crcrt 

y^ 1 - yy L - 


gcc 


gcc 


crtcr 


gaa 


gcc 


crcra 


tta 


aca 


ssr tyr 


ser 


VCtX 


hi s 


val 




thr 


a sp 


asp 


1 =5 rr 1 \r 

cl -L ci y -•- J 


ala 


ala 


val 


glu 


ala 


crlv 


leu 


ala 


fifil / 9 9 1 

D O X / Z, j£ 1. 


















691/231 


















y*-g ygg 


cga 


gtt 


age 


egg 


^ t-r- 


y *-y 


ate 


y 


gcg etc 


crcrt 
y y »- 


tec 


crcrcr 
y =j y 


acc 


age 


croa 


tta 


ccg 


Val gly 


arg 


Val 


ser 


arg 


1 1p 

J Lc 


V d X 


J — LC 


ser 


CLXQ X C Li. 


y - L jy 


ser 


y y 


thr 


ser 


crlv 

y x_y 


leu 


pro 


"7 9 1 / 9 A 1 


















7 51/2 51 


















QTCC ggt 


ggc 


tgg 


acg 


egg 


ggc 


cgc 




gtg 


t-L-y y^-y 


gtc 


gtc 


gac 


crcrc 


gac 


crcrt 
yy 


gcc 


gcc 


did. gly 


gly 


t rp 


L.X1X 


arg 


gly 


arg 


CLXd 


val 


leu ala 


va 1 


val 


asp 


crlv 
y jr 


asp 


crlv 


ala 


ala 




















811/271 


















y dy C L.y 


+-+-,-. 


gcc 


y"9*9" 


gag 


ggc 


gcc 


t gc 


rrt~ cr 

y *-y 




eecr 


crcrt 
yy L - 


cca 


gac 


gcc 


crtcr 


aca 


ccg 


yiu leu 


phG 


GlX CL 


gly 


glu 


gly 


ala 


t,y& 


va 1 


leu arg 


pro 


rrl \/ 


pro 


asp 


ala 


val 


thr 


pro 


841/281 


















871/291 


















gcc gcc 


gat 


ate 


ag t 


gcc 


cac 


cag 


c tg 


cr tcr 
y ^-y 


egg gcc 


gtg 


gta 


crac 


acc 


ggc 


gcc 


gcg 


cac 


ala ala 


asp 


X X C 


ser 


ala 


hi s 


gin 


1 eu 


va 1 


arg ala 


val 


val 


asp 


thr 


crlv 


ala 


ala 


his 


901/301 


















931/311 


















gtg atg 


gtg 


ctg 


eee 


aa t 


crcrc 


tat 


cr tcr 
y u y 


gcc 


gcc gaa 


gaa 


c to 


crtcr 


gcc 


ggg 


tgt 


acc 


gcg 


val met 


val 


1 GU 


pro 


asn 


y -Ljr 


i_y x 


val 


ala 


ala glu 


glu 


leu 


val 


ala 


crlv 


cys 


thr 


ala 


961/321 


















991/331 


















gcg ate 


ggc 




y y 


gtc 


gac 


at a 
y u y 


gta 


ccc 


gtg ccg 


acc 


crcra 


tea 


atg 


gtg 


cag 


ggg 


ttg 


ala ile 


gly 




y J-y 


val 


asp 


val 


va 1 


pro 


val pro 


thr 


crlv 


ser 


met 


val 


gin 


gly 


leu 


1021/341 
















1051/351 
















gcc gcg 


ctg 




cri~ cr 
y *-y 


cat 


gac 


crccr 
y^y 


gcc 


cgc 


cag gcc 


gtc 


aac 


gac 


ggc 


tac 


age 


atg 


gcc 


ala ala 


leu 


Cl x d 


•trn 1 

v ct X 


hi s 


asp 


ala 


ala 


arg 


gin ala 


val 


asp 


asp 


crlv 


tyr 


ser 


met 


ala 


1081/361 
















1111/371 
















cgt gcc 


gcc 


yy L 


gc t 


tec 


^yy 


cac 


crcra 
yy a 


teg 


gtg cgc 


att 


gcc 


acc 


caa 


aag 


gcg 


ctg 


acc 


arg ala 


ala 


y -Ly 


ala 


ser 


arg 


hi s 


cr 1 v 

y x_y 


ser 


val arg 


ile 


ala 


thr 


gin 


lys 


ala 


leu 


thr 


1141/381 
















1171/391 
















tgg gcc 


ggt 


acc 


t gc 


aag 


ccg 


crcrc 
yy^- 


gac 


crcrt 


ctg ggt 


ate 


gcg 


ggc 


gac 


gag 


gtg 


ctg 


ate 


trp ala 


gly 


thr 


cys 


lvs 


pro 


crlv 


asp 


crlv 

3 Jr 


leu gly 


ile 


ala 


gly 


asp 


glu 


val 


leu 


ile 


1201/401 
















1231/411 
















gtc gcc 


gac 


gat 


gtc 


gcc 


gcg 


gcg 


gcc 


ate 


ggt ctg 


gtc 


gac 


ctg 


ttg 


ttg 


gca 


teg 


gga 


val ala 


asp 


asp 


val 


ala 


ala 


ala 


ala 


ile 


gly leu 


val 


asp 


leu 


leu 


leu 


ala 


ser 


gly 


1261/421 
















1291/431 
















ggc gat 


ctg 


gtg 


acg 


gtg 


eta 


att 


ggc 


gcc 


ggc gta 


acc 


gaa 


gac 


gtg 


get 


gtc 


gtc 


ctg 


gly asp 


leu 


val 


thr 


val 


leu 


ile 


gly 


ala 


gly val 


thr 


glu 


asp 


val 


ala 


val 


val 


leu 


1321/441 
















1351/451 
















gaa egg 


cat 


gtg 


cac 


gac 


cac 


cat 


cca 


ggc 


acc gag 


ctg 


gtc 


tec 


tac 


cgc 


acc 


gga 


cac 


glu arg 


his 


val 


his 


asp 


his 


his 


pro 


gly 


thr glu 


leu 


val 


ser 


tyr 


arg 


thr 


gly 


his 


1381/461" 
















1411/471 
















cgc ggc 


gac 


gcg 


ctg 


ctg 


ate 


ggg 


gtc 


gag 


tag 


















arg gly 


asp 


ala 


leu 




ile 


gly val 


glu) AMB 



















SEQ ID NOS; 695-696 (continued) 



FIG. 41 F (continu d) 



SEQ ID NO: 697: first reading frame nucleic acid 
SEQ ID NO: 702: second reading frame nucleic acid 
SEQ ID NO: 717: third reading frame nucleic acid 

Seq41T comprising seq 41F and seq 41S__ — - SEQ ID NO: 698 
1/J^SEQ ID NO: 697,702,717 — "31/ 11 

t ta ggc teg ggg^ege ggg ctg/ gcg egg agg agg tgt gtt gcg gag gag gtg tgt tgt agt 
leu gly ser gly arg gly leu ala arg arg arg cys val ala glu glu val cys cys ser 
AMB (ala arg gly ala gly trp arg gly gly gly val leu arg arg arg cys val val val 
( arg *\leu gly ala arg ala gly ala glu glu val cys cys gly gly gly val leu) AMB (trp 

6l72l>vSEQI D^ NO: 703 ^ SEQ ID NO: 718 91/31 SEQ ID NO: 719 

ggg gac ggc gga teg gec gtt gga cgc etc ggc ctt gcg gga ctg ggc aca cgc cgt cgt 
gly asp gly gly ser ala val gly arg leu gly leu ala gly leu gly thr arg arg arg 
gly thr ala asp arg pro leu asp ala ser ala leu arg asp trp ala his ala val val 
gly arg arg ile gly arg trp thr pro arg pro cys gly thr gly his thr pro ser ser 
121/41 151/51 

cag cga tct gat cct cca cat cga cga gat caa ccg get caa tgt gtt ccc ggt cgc tga 
gin arg ser asp pro pro his arg arg asp gin pro ala gin cys val pro gly arg)OPA 
ser asp leu ile leu his ile asp glu ile asn arg leu asn val phe pro val ala asp 
ala ile)OPA(ser ser thr ser thr arg ser thr gly ser met cys ser arg ser leu thr 
181/61 SEQ ID NO: 720 211/71 

etc cga tac egg cgt caa cat get gtt cac cat gcg tgc cgc ggt cgt aga age tga ttt 
(leu arg tyr arg arg gin his ala val his his ala cys arg gly arg arg ser)OPA(phe 
sefNasp thr gly val asn met leu phe thr met arg ala ala val val glu ala asptleu 
pro\ile pro ala ser thr cys cys ser pro cys val pro arg ser)AMB(lys leu ile) cys 
241/81 SEQ ID NO: 699 271/91 SEQ ID NO: 721^^SEQ ID NO: 700 

gca cgc gaa ttc gca ggc tga cgc cga aga cgt ggc gcg ggt tgc ggc cgc tct cgc ggc 
ala arg glu phe ala gly)OPA(arg arg arg arg gly ala gly cys gly arg ser arg gly 
his ala asn ser gin ala asp^ala glu asp val ala arg val ala ala ala leu ala ala 
thr arg ile arg arg leu thr) pro lys thr trp arg gly leu arg pro leu ser arg pro 
301/101 SEQ ID NO: 701 — ^ 331/111 

egg cgc gcg ttg aac gga get cgc ggc aac tec ggc gtg ate ctg tec cag ate ctg cgc 
arg arg ala leu asn gly ala arg gly asn ser gly val ile leu ser gin ile leu arg 
gly ala arg)OPA(thr glu leu ala ala thr pro ala)OPA(ser cys pro arg ser cys ala 

ala arg val glu^arg ser ser arg gin leu arg arg aspV>ro val pro asp pro ala arg 
361/121 ^SEQ ID NO: 704 391/131 ^SEQ ID NO: 705 

ggg ate gca gag gtg ace gcg act gcg gee gec gee tct ggc gcg gta ttg egg gcg gtc 
gly ile ala glu val thr ala thr ala ala ala ala ser gly ala val leu arg ala val 
gly ser gin arg)OPA(pro arg leu arg pro pro pro leu ala arg tyr cys gly arg ser 

asp arg arg gly asp^arg asp cys gly arg arg leu trp arg gly ile ala gly gly arg 
421/141 ^SEQ ID NO: 706 451/151 

gac gec aac gec etc ggg gec gcg ttg tgg cgc ggc gtc gag ttg gtc gtc gcg teg atg 
asp ala asn ala leu gly ala ala leu trp arg gly val glu leu val val ala ser met 
thr pro thr pro ser gly pro arg cys gly ala ala ser ser trp ser ser arg arg trp 
arg gin arg pro arg gly arg val val ala arg arg arg val gly arg arg val asp gly 
481/161 511/171 

ggt ggc gtg gag gtg ccg gga act ate gtc teg gtg ctg egg gee gee gec gga gec gtc 
gly gly val glu val pro gly thr ile val ser val leu arg ala ala ala gly ala val 
val ala trp arg cys arg glu leu ser ser arg cys cys gly pro pro pro glu pro ser 
trp arg gly gly ala gly asn tyr arg leu gly ala ala gly arg arg arg ser arg arg 
541/181 571/191 

gac cag tgc gcg cac gag ggg ttg gee ggt gcg gtc acc gee gee ggt gac gcg gcg gtc 
asp gin cys ala his glu gly leu ala gly ala val thr ala ala gly asp ala ala val 

thr ser ala arg thr arg gly trp pro val arg ser pro pro pro val thr arg arg ser 
pro val arg ala arg gly val gly arg cys gly his arg arg arg)OPA(arg gly gly his 
601/201 631/211 SEQ ID NO: 722-^ 

ate gcg ctg gaa aag acc ccc gaa cag ctt gac gtg etc gee gat gcg ggc gcg gtg gac 
ile ala leu glu lys thr pro glu gin leu asp val leu ala asp ala gly ala val asp 

ser arg trp lys arg pro pro asn ser leu thr cys ser pro met arg ala arg trp thr 
arg ala gly lys asp pro arg thr ala)OPA( arg al a arg arg cys gly arg gly gly arg 
661/221 • 69f/231^SEQ ID NO: 723 

gee ggc gga egg ggc ctg ctg gtt ctg ctg gac gcg ttg cgc tec acc ate tgc ggg cag 
ala gly gly arg gly leu leu val leu leu asp ala leu arg ser thr ile cys gly gin 
pro ala asp gly ala cys trp phe cys trp thr arg cys ala pro pro ser ala gly arg 
arg arg thr gly pro ala gly ser ala gly arg val ala leu his his leu arg ala gly 

SEQ ID NOSs 697-727 



FIG. 41 G 



••721/241 ' 751/251 

gca cct gcc egg gcg gtc tac gaa ccc teg ccg cgc gcg ttg ccg ace gac acg get ace 
ala pro ala arg ala val tyr glu pro ser pro arg ala leu pro thr asp thr ala thr 
his leu pro gly arg ser thr asn pro arg arg ala arg cys arg pro thr arg leu pro 
thr cys pro gly gly leu arg thr leu ala ala arg val ala asp arg his gly tyr pro 
781/261 811/271 

caa cgc ccc gcc ccg caa ttc gag gtg atg tat ctg ttg gcg gta tgt gat get gca gcg 
gin arg pro ala pro gin phe glu val met tyr leu leu ala val cys asp ala ala ala 
asn ala pro pro arg asn ser arg)OPA(cys ile cys trp arg tyr val met leu gin arg 

thr pro arg pro ala ile arg gly/asp val ser val gly gly met)OPA{cys cys ser gly 

841/281 SEQ ID NO: 707 871/291 SEQ ID NO: 724 0 

gcg gac cag ttg egg gat cga etc aag gaa ttg ggt gag teg gtg gcc ate gcc get get 
ala asp gin leu arg asp arg leu lys glu leu gly glu ser val ala ile ala ala ala 
arg thr ser cys gly ile asp ser arg asn trp val ser arg trp pro ser pro leu leu 

gly pro val ala gly ser thr gin gly ile gly)OPA(val gly gly his arg arg cys ser 
901/301 931/311 SEQ ID NO: 725 

ccg ccc gac age tac tec gta cac gtc cac acc gac gac gcc ggt gcc gcc gtg gaa gcc 
pro pro asp ser tyr ser val his val his thr asp asp ala gly ala ala val glu ala 
arg pro thr ala thr pro tyr thr ser thr pro thr thr pro val pro pro trp lys pro 

ala arg gin leu leu arg thr arg pro his arg arg arg arg cys arg arg gly ser arg 
961/321 991/331 

gga ttg gcg gtg ggg cga gtt age egg ate gtg ate teg gcg etc ggt tec ggg acc age 
gly leu ala val gly arg val ser arg ile val ile ser ala leu gly ser gly thr ser 

asp trp arg trp gly glu leu ala gly ser)OPA(ser arg arg ser val pro gly pro ala 
ile gly gly gly ala ser)AMB(pro asp arg asp\leu gly ala arg phe arg asp gin arg 

1021/341 SEQ ID NO: 726--* 1051/351^— SEQ ID NO: 708 

gga ttg ccg gcc ggt ggc tgg acg egg ggc cgc gcc gtg ctg gcg gtc gtc gac ggc gac 
gly leu pro ala gly gly trp thr arg gly arg ala val leu ala val val asp gly asp 

asp cys arg pro val ala gly arg gly ala ala pro cys trp arg ser ser thr ala thr 
ile ala gly arg trp leu asp ala gly pro arg arg ala gly gly arg arg arg arg arg 
1081/361 1111/371 

ggt gcc gcc gag ctg ttc gcc ggg gag ggc gcc tgc gtg ctg cga ccg ggt cca gac gcc 
gly ala ala glu leu phe ala gly glu gly ala cys val leu arg pro gly pro asp ala 
val pro pro ser cys ser pro gly arg ala pro ala cys cys asp arg val gin thr pro) 
cys arg arg ala val arg arg gly gly arg leu arg ala ala thr gly ser arg arg arg 
1141/381 1171/391 

gtg aca ccg gcc gcc gat ate agt gcc cac cag ctg gtg egg gcc gtg gta gac acc ggc 
val thr pro ala ala asp ile ser ala his gin leu val arg ala val val asp thr gly 
OPA(his arg pro pro ile ser val pro thr ser trp cys gly pro trp)AMB(thr pro ala 

asp thr\gly arg arg tyr gin cys pro pro ala gly ala gly arg gly arglhis arg arg 

1201/401 ^ SEQ ID NO: 709 1231/411 SEQ ID NO: 710 ^ 

gcc gcg cac gtg atg gtg ctg ccc aat ggc tat gtg gcc gcc gaa gaa ctg gtg gcc ggg 
ala ala his val met val leu pro asn gly tyr val ala ala glu glu leu val ala gly 
pro arg thr)OPA(trp cys cys pro met ala met trp pro pro lys asn trp trp pro gly 

arg ala arg asplgly ala ala gin trp leu cys gly arg arg arg thr gly gly arg val 
1261/421 SEQ ID NO: 711 1291/431 

tgt acc gcg gcg ate ggc tgg ggc gtc gac gtg gta ccc gtg ccg acc gga teg atg gtg 
cys thr ala ala ile gly trp gly val asp val val pro val pro thr gly ser met val 
val pro arg arg ser ala gly ala ser thr trp tyr pro cys arg pro asp arg trp cys 
tyr arg gly asp arg leu gly arg arg arg gly thr arg ala asp arg ile asp gly ala 
1321/441 1351/451 

cag ggg ttg gcc gcg ctg gcc gtg cat gac gcg gcc cgc cag gcc gtc gac gac ggc tac 
gin gly leu ala ala leu ala val his asp ala ala arg gin ala val asp asp gly tyr 
arg gly trp pro arg trp pro cys met thr arg pro ala arg pro ser thr thr ala thr 
gly val gly arg ala gly arg ala)OPA(arg gly pro pro gly arg arg arg arg leu gin 
1381/461 SEQ ID NO: 727 * 1411/471 

age atg gcc cgt gcc gcc ggt get tec egg cac gga teg gtg cgc att gcc acc caa aag 
ser met ala arg ala ala gly ala ser arg his gly ser val arg ile ala thr gin lys 
ala trp pro val pro pro val leu pro gly thr asp arg cys ala leu pro pro lys arg 
his gly pro cys arg arg cys phe pro ala arg ile gly ala his cys his pro lys gly 



.SE0 ID NOSs 697-727 (continued 1) 

FIG. 41 G (continued 1) 




1441/481 1471/491 

gcg ctg acc tgg gcc ggt acc tgc aag ccg ggc gac ggt ctg ggt ate gcg ggc gac gag 
ala leu thr trp ala gly thr cys lys pro gly asp gly leu gly ile ala gly asp glu 
arg)OPA(pro gly pro val pro ala ser arg ala thr val trp val ser arg ala thr arg 
ala aspfcleu gly arg tyr leu gin ala gly arg arg ser gly tyr arg gly arg arg gly 
1501/501 SEQ ID NO: 712 1531/511 

gtg ctg ate gtc gcc gac gat gtc gcc gcg gcg gcc ate ggt ctg gtc gac ctg ttg ttg 
val leu ile val ala asp asp val ala ala ala ala ile gly leu val asp leu leu leu 
cys)OPA(ser ser pro thr met ser pro arg arg pro ser val trp ser thr cys cys trp 
ala aspVarg arg arg arg cys arg arg gly gly his arg ser gly arg pro val val gly 
1561/521 SEQ ID NO: 713 1591/531 

gca teg gga ggc gat ctg gtg acg gtg eta att ggc gcc ggc gta acc gaa gac gtg get 
ala ser gly gly asp leu val thr val leu ile gly ala gly val thr glu asp val ala 
his arg glu ala ile trp)OPA arg cys OCH(leu ala pro ala)OCH(pro lys thr trp leu 
ile gly arg arg ser gly asp gly ala/asn trp arg arg arg asrAarg arg arg gly cys 
1621/541 SEQ ID NO: 714-—"^^ 1651/551 SEQ ID NO: 715 

gtc gtc ctg gaa egg cat gtg cac gac cac cat cca ggc acc gag ctg gtc tec tac cgc 
val val leu glu arg his val his asp his his pro gly thr glu leu val ser tyr arg 
ser ser trp asn gly met cys thr thr thr ile gin ala pro ser trp ser pro thr ala 
arg pro gly thr ala cys ala arg pro pro ser arg his arg ala gly leu leu pro his 
1681/561 1711/571 
acc gga cac cgc ggc gac gcg ctg ctg ate ggg gtc gag tag 
thr gly his arg gly asp ala leu leu ile gly val glu) AMB 
pro asp thr ala ala thr arg cys)OPA(ser gly ser) ser 
arg thr pro arg arg arg ala ala aspJ arg gly arg val 
SEQ ID NO: 716-^ 



SEQ ID NOSs 697-727 (continued 2) 
FIGo 41 G (continued 2) 



-i£L^- — 


-SEQ 


ID NO: 728 










31/11 


















GCC GGT 


AAC 


GCC 


GCG 


TCC 


CAG 


TGC 


TAT 


CCG 


TCC GCC 


GGA 


CCG 


CCC 


GAA 


ACA 


TCA 


GCG 


GCG 


(ala gly 


asn 


ala 


ala 


ser 


gin 


cys 


tyr 


pro 


ser ala 


gly 


pro 


pro 


glu 


thr 


ser 


ala 


ala 


6l72lT"^- 


-SEQ 


ID NO: 729 










91/31 


















GGC GCC 


CCG 


GTC 


GGC 


CGC 


GGC 


CGG 


GCT 


CGA 


CCC GCT 


CCA 


CCT 


GGC 


CAT 


CAG 


CGA 


CCA 


GGT 


gly ala 


pro 


val 


gly 


arg 


gly 


arg 


ala 


arg 


pro ala 


pro 


pro 


gly 


his 


gin 


arg 


pro 


gly 


121/41 


















151/51 


















TAT CGA 


GGT 


GGA 


AGC 


GGA 


CGG 


TGT 


TGG 


GAT 


GCA CGC 


CCA 


ACT 


TGC 


CGG 


CGA 


TCG 


CGG 


CGA 


tyr arg 


gly 


gly 


ser 


gly 


arg 


cys 


trp 


asp 


ala arg 


pro 


thr 


cys 


arg 


arg 


ser 


arg 


arg 


181/61 


















211/71 


















TGC TCA 


TCG 


GAA 


CCC 


GCG 


ACG 


CAC 


ACA 


ATG 


CCC GCA 


GCA 


CCG 


CAC 


GAC 


GGC 


GCC 


CCA 


CCG 


cys ser 


ser 


glu 


pro 


ala 


thr 


his 


thr 


met 


pro ala 


ala 


pro 


his 


asp 


gly 


ala 


pro 


pro 


241/81 


















271/91 


















GCT CTT 


GCA 


GTG 


ACC 


TGA 


TGA 


TGA 


CAC 


TCA 


CCC CCA 


TAA 


GGC 


TCG 


TCG 


GCT 


GCG 


CCT 


GAG 


ala leu 


ala 


val 


thr)OPA 


OPA 


OPA (his 


ser 


pro pro)OCH(gly 


ser 


ser 


ala 


ala 


pro 


glu 


301/101 






SEQ 


ID NO:730 — * 






331/111 




^-SEQ ID NO 


: 731 






CAA TGC 


AGT 


AAG 


TTT 


ACA 


CAA 


ACG 


GAC 


TTG 


TAA AAA 


CCT 


GCG 


GAG 


GTG 


GGG 


TCT 


ATG 


GCC 


gin cys 


ser 


lys 


phe 


thr 


gin 


thr 


asp 


leu)OCH(lys 


pro 


ala 


glu 


val 


gly 


ser 


met 


ala 


361/121 


















391/131 


^SEQ 


ID NO: ' 


732 










AAC AAA 


CGT 


GGC 


AAT 


GCC 


GGG 


CAG 


CCT 


CTG 


CCC TTG 


TCG 


GAT 


C 












asn /lys 


arg 


gly 


asn 


ala 


gly 


gin 


pro 


leu 


pro leu 


ser 


asp) 













SEQ ID MOSS728-732 



FIG. 42A 



1/1^ SEQ 


ID NO: 733 












31/11 


















CCG GTA 


ACG 


CCG 


CGT 


CCC 


AGT 


GCT 


ATC 


CGT 


CCC CCC 

V_V_VJ V_V_VJ 


GAC 


CGC 


CCG 


AAA 


CAT 


CAG 


CGG 


CGG 


[pro val 


thr 


pro 


arg 


pro 


ser 


ala 


ile 


ci ±- y 




asp 


arg 


pro 




his 


gin 


arg 


arg 


61/21^SEQ ID NO: 734 










91/31 


















GCG CCC 


CGG 


TCG 


GCC 


GCG 


CCC 


CCC 


CTC 


CAC 

VJ.tt.V_ 


CCC CTC 
V_ V — \J v^. x v_ 


CAC 

V_.tt.V_ 


CTC 

V_ i. V3 


GCC 


ATC 


AGC 


GAC 


CAG 


GTT 


ala pro 


arg 


ser 


ala 


ala 


ala 


gly 


1 eu 


asp 


pro leu 


his 


1 eu 


ala 


ile 


ser 


asp 


gin 


va 1 


121/41 


















151/51 


















ATC GAG 


GTG 


GAA 


GCG 


GAC 


GGT 


GTT 


GGG 


ATG 


CAC CCC 


CAA 


CTT 


GCC 


GGC 


GAT 


CGC 


GGC 


GAT 


ile glu 


val 


glu 


ala 


asp 


y J-y 


val 


y -Ly 


me t 




gin 


1 eu 


ala 


y x y 


asp 


arg 


y x j 


asp- 


181/61 


















211/71 


















GCT CAT 


CGG 


AAC 


CCG 


CGA 


CGC 


ACA 


CAA 


TGC 


CCG CAG 


CAC 


CGC 


ACG 


ACG 


GCG 


CCC 


CAC 


CGG 


ala his 


arg 


asn 


pro 


arg 


arg 


thr 


gin 


cys 


pro gin 


his 


arg 


thr 


thr 


ala 


pro 


his 


arg 


241/81 


















271/91 


















CTC TTG 


CAG 


TGA 


CCT 


GAT 


GAT 


GAC 


ACT 


CAC 


CCC CAT 


AAG 


GCT 


CGT 


CGG 


CTG 


CGC 


CTG 


AGC 


leu leu 


gin) OPA(pro 


asp 


asp 


asp 


thr 


his 


pro his 


lys 


ala 


arg 


arg 


leu 


arg 


leu 


ser 


301/101 






X SEQ ID 


NO: 


735 






331/111 


















AAT GCA 


GTA 


AGT 


TTA 


CAC 


AAA 


CGG 


ACT 


TGT 


AAA AAC 


CTG 


CGG 


AGG 


TGG 


GGT 


CTA 


TGG 


CCA 


asn ala 


val 


ser 


leu 


his 


lys 


arg 


thr 


cys 


lys asn 


leu 


arg 


arg 


trp 


gly 


leu 


trp 


pro 


361/121 


















391/131 


















ACA AAC 


GTG 


GCA 


ATG 


CCG 


GGC 


AGC 


CTC 


TGC 


CCT TGT 


CGG 


ATC 














thr asn 


val 


ala 


met 


pro 


gly 


ser 


leu 


cys 


pro cys 


arg 


ile) 













SEQ ID NOSS733-735 

FIGo 42B 



31/11 

TCC GTC CGC CGG ACC GCC CGA AAC ATC AGC GGC GGG 
ser val arg arg thr ala arg asn ile ser gly gly 
91/31 

TCG ACC CGC TCC ACC TGG CCA TCA GCG ACC AGG TTA 
ser thr arg ser thr trp pro ser ala thr arg leu 
151/51 

GGA TGC ACG CCC AAC TTG CCG GCG ATC GCG GCG ATG 
gly cys thr pro asn leu pro ala ile ala ala met 
211/71 

AAT GCC CGC AGC ACC GCA CGA CGG CGC CCC ACC GGC 
asn ala arg ser thr ala arg arg arg pro thr gly 
271/91 

CTC ACC CCC ATA AGG CTC GTC GGC TGC GCC TGA GCA 
leu thr pro ile arg leu val gly cys ala)OPA ala 
331/111 

CTT GTA AAA ACC TGC GGA GGT GGG GTC TAT GGC CAA 
leu val lys thr cys gly gly gly val tyr gly gin 

391/131 
TCT GCC CTT GTC GGA TC 
ser ala leu val gly) 



SEQ ID NOSs736-738 

FIG. 42C 



1 /1 ^ SEQ ID NO: 736 

CGG TAA CGC CGC GTC CCA GTG CTA 

arg OCH(arg arg val pro val leu 

61/21 ^SEQ ID NO: 737 

CGC CCC GGT CGG CCG CGG CCG GGC 

arg pro gly arg pro arg pro gly 

121/41 

TCG AGG TGG AAG CGG ACG GTG TTG 
ser arg trp lys arg thr val leu 
181/61 

CTC ATC GGA ACC CGC GAC GCA CAC 
leu ile gly thr arg asp ala his 
241/81 

TCT TGC AGT GAC CTG ATG ATG ACA 
ser cys ser asp leu met met thr 
301/101 

ATG CAG TAA GTT TAC ACA AAC GGA 
met gin 0CH(val tyr thr asn gly 
361/121 ^SEQ ID NO: 738 

CAA ACG TGG CAA TGC CGG GCA GCC 
gin thr trp gin cys arg ala ala 




Coding sequence Rv2622 predicted by Cole et al . , 1998 (Nature 393:537-544) 
and containing seq42A: 



1/1^- SEQ ID 


NO: 


739 












31/11 


















atg gcc 


aac 


aaa 


cgt 


ggc 


aat 


gcc 


ggg 


cag 


cct ctg 


ccc 


ttg 


teg 


gat 


cga 


gac 


gac 


gac 


(Met ala 


asn 


lys 


arg 


gly 


asn 


ala 




gin 


pro leu 


pro 


leu 


ser 


asp 


arg 


asp 


asp 


asp 


61/21 " 


^SEQ ID 


NO: 


740 










91/31 


















cac atg 


cag 


crcrcf 


cac 


tgg 


CtQ 


ctcr 


gcc 


ccrcr 
*-* y y 


Ctcr crcrc 


aaa 


ccrcr 


ata 

y -J 


eta 


cgt 


ccc 


ggc 


ggc 


his met 


gin 


crlv 
y Jr 


his 


trp 


leu 


leu 


ala 


arcr 


leu crlv 


Ivs 


arcr 


val 


leu 


ara 


pro 


gly 


gly 


121/41 


















151/51 


















gtc gaa 


c tc 


acc 


ccrcr 


aca 


ctcr 


ctg 


gcc 


cgc 


ctcc craa 


ata 

y •—y 


acc 


aac 


gcc 


aac 


ata 

y y 


etc 


gag 


val rr 1 i ] 


leu 


thr 


arcr 


thr 


leu 


leu 


ala 


arg 


ala glu 


val 


thr 


asp 


ala 


asp 


val 


leu 


alu 

y v -*- 


181/61 


















211/71 


















ctg gca 


ccg 


crcrc: 
y y 


ctg 


crac 
y y v - 


cgc 


acc 


gca 


gcc 


gaa ate 


ttg 


gcc 


cgc 


aac 


ccg 


caa 
y y 


teg 


tac 


~\ f^U Fi 1 ri 

KJL JL CA 


pro 


y j-y 


leu 


crlv 


arg 


thr 


ala 


ala 


glu ile 


leu 


ala 


ara 


asn 


pro 


arg 


ser 


tvr 


241/81 


















271/91 


















rr\~ a aaa 
y "-y yyy 


y *-y 


cm cr 

y ciy 


age 


gat 


ccc 


aac 


crccr 
y *—y 


gcc 


aac ctg 


gtc 


cga 


cac 


att 


etc 


gcc 


aac 
y y *— 


cgc 


val gly 


ala 


glu 


ser 


asp 


pro 


asn 


ala 


ala 


asn leu 


val 


arg 


his 


val 


leu 


ala 


alv 


ara 


301/101 


















331/111 


















aac aac 


gtc 


ccrcr 
*—y y 


crtc 


acc 


gac 


crccr 

y *-*y 


gcc 


gat 


acc crcra 


tta 


tec 


gac 


gcc 


age 


gcc 


gat 


gtc 


C~T~\ \r p\ cn 


val 


arg 


val 


thr 


asp 


ala 


ala 


asp 


thr gly 


leu 


ser 


asp 


ala 


ser 


ala 


asp 


val 


361/121 


















391/131 


















gtc ate 


crcrc 


aaa 
y a y 


crccr 
y y 


atg 


ctg 


acc 


atg 


caa 


ggc aac 


crccr 
y y 


get 


aaa 


cac 


acg 


ate 


ate 


gcc 


va 1 i 1 e 


y j-y 


glu 


ala 


met 


leu 


thr 


met 


crln 


gly asn 


ala 


ala 


lys 


his 


thr 


ile 


val 


ala 


421/141 


















451/151 


















era. ci crccr 
y^y y^*y 


crccr 
y '—y 


ccrcr 


cr tcr 
y L y 


c tcr 


acrcr 
°yy 


ccg 


crcrfc 
yy ( - 


crcrc 
y y v - 


cgc tac 


crccr 
y-y 


att 


cac 


gaa 


eta 


aca 
y ^ y 


eta 


ata 
y y 


rrlu ala 


ala 


arg 


val 


leu 


arg 


pro 


crlv 

y Jr 


crlv 


arg tyr 


ala 


ile 


his 


crlu. 


leu 


ala 


leu 


val 


481/161 


















511/171 


















ccg gac 


gac 


cr tc 


gca 


cracr 

y *-*y 


cag 


crtc 


cere 


acc 


gac ctg 


egg 


cag 


teg 


ctg 


gcc 


cgc 


gcg 


etc 


pro asp 


asp 


val 


ala 


crlu 

_1_ VJ, 


crln 


val 


arcr 


thr 


asp leu 


arg 


gin 


ser 


leu 


ala 


arg 


ala 


leu 


541/181 


















571/191 


















aag gtc 


aat 


gcg 


cgt 


ccg 


ctg 


acc 


gtt 


crccr 
y y 


craa tcrcr 

y <-*■*■* y y 


tea 


cac 


etc 


tta 


gcg 


ggc 


cat 


gga 


lys val 


asn 


ala 


arg 


pro 


leu 


thr 


val 


ala 


glu trp 


ser 


his 


leu 


leu 


ala 


gly 


his 


gly 


601/201 


















631/211 


















ctg gtc 


gtc 


gaa 


cac 


gtt 


gtc 


acc 


get 


tec 


atg gcg 


ttg 


tta 


caa 


ccg 


cga 


egg 


gtg 


ate 


leu val 


val 


glu 


his 


val 


val 


thr 


ala 


ser 


met ala 


leu 


leu 


gin 


pro 


arg 


arg 


val 


ile 


661/221 


















691/231 


















get gac 


gaa 


ggc 


etc 


ctg 


ggt 


gcg 


ctg 


egg 


ttc gcc 


gga 


aac 


ctg 


etc 


ate 


cat 


cgt 


gcc 


ala asp 


glu 


gly 


leu 


leu 


gly 


ala 


leu 


arg 


phe ala 


gly 


asn 


leu 


leu 


ile 


his 


arg 


ala 


721/241 


















751/251 


















gcg cgt 


egg 


cga 


gtc 


ctg 


ttg 


atg 


cgc 


cac 


aca ttc 


cgc 


agg 


cat 


cgt 


gaa 


cgc 


ttg 


aca 


ala arg 


arg 


arg 


val 


leu 


leu 


met 


arg 


his 


thr phe 


arg 


arg 


his 


arg 


glu 


arg 


leu 


thr 


781/261 


















811/271 


















gcc gtc 


gcc 


att 


gtc 


gcg 


cac 


aaa 


ccg 


cac 


gtc gat 


teg 


tga 














ala val 


ala 


ile 


val 


ala 


his 


lys 


pro 


his 


val asp 


ser)OPA 















SEQ ID MOSs739-740 



FiGo 42D 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv2622 



1/J1__SEQ ID NO: 741 
taa aaa cct gcg gag gtg ggg 
OCH(lys pro ala glu val gly 
61/2 SEQ ID NO: 742 

ccc ttg teg gat cga gac gac 
pro leu ser asp arg asp asp 
121/41 

aag egg gtg ctg cgt ccc ggc 
lys arg val leu arg pro gly 
181/61 

gtg acc gac gec gac gtg etc 
val thr asp ala asp val leu 
241/81 

ttg gee cgc aac ccg egg teg 
leu ala arg asn pro arg ser 
301/101 

gtc cga cac gtt etc gee ggc 
val arg his val leu ala gly 
361/121 

tta tec gac gec age gee gat 
leu ser asp ala ser ala asp 
421/141 

gcg get aaa cac acg ate gtc 
ala ala lys his thr ile val 
481/161 

gcg att cac gaa eta gcg ctg 
ala ile his glu leu ala leu 
541/181 

egg cag teg ctg gee cgc gcg 
arg gin ser leu ala arg ala 
601/201 

teg cac etc tta gcg ggc cat 
ser his leu leu ala gly his 
661/221 

ttg tta caa ccg cga egg gtg 
leu leu gin pro arg arg val 
721/241 

gga aac ctg etc ate cat cgt 
gly asn leu leu ile his arg 
781/261 

cgc agg cat cgt gaa cgc ttg 

arg arg his arg glu arg leu 

841/281 

teg tga 

ser)OPA 









31/11 






tct 


a tg 


gee 




cert 

i_y 


aac 
yy 


ser 


me t 


ala 


cx cz> 1 1 x_y& 


d J- y 


y -Lj 








91/31 






gac 


cac 


a t g 


v-ciy yyy 


cac 




asp 


his 


me t 


y xi i y iy 


his 


trr> 

L-X 








-1 C -1 / C -1 






ggc 


gtc 


gaa 


v - i — ^ - d \— v_ 


prrrr 


aca 


gly 


VCt _L 


glu 


leu thr 


arg 


L.IXX 








211/71 






yay 


ctg 


y \~cx 


<-<-y yy<- 


c tg 




glu 


1 eu 


aid 


pi o y ly 


1 eu 


gly 








271/91 






tac 


y *-y 


yyy 


y*-y yo-y 


age 


ga t 




va 1 


y xy 


ci x cl y 1 u 


ser 


asp 








J J X / XXX 






egc 


ggc 


gac 


/-r 4- /— * i"" IT /T 

y tyy 


gtc 


ace 


arg 


gly 


asp 


val arg 


vdx 


thr 








J-/X / X J X 






gtc 


gtc 


pi t- r* 


ggc gag 


g c y 


a Ly 


vax 


Val 


lie 


/T 1 1 r /T Til 

gxy gxu 


ala 


ine t 








Jl / X _) X 






gee 


y ciy 


y <-y 


yuy t-yy 


fit" fT 

y <-y 


ctg 


Cl -LCI 


glu 


ala 


CL X CL ax y 


val 


leu 








511/171 








ccg 


gac 


gac gtc 


gca 


cracr 


val 


pro 


asp 


Clw£«/ V -L- 


ala 


glu 








571 / 191 






etc 


aag 


gtc 


aat crccf 


cgt 


ccg 


1 eu 


lys 


val 


ClOll CLXd 


arg 


pro 








631/211 






gga 


ctg 


gtc 


gtc gaa 


cac 


gtt 


gly 


leu 


val 


val glu 


his 


val 








691/231 






ate 


get 


gac 


gaa ggc 


etc 


ctg 


ile 


ala 


asp 


glu gly 


leu 


leu 








751/251 






gee 


gcg 


cgt 


egg cga 


gtc 


ctg 


ala 


ala 


arg 


arg arg 


val 


leu 








811/271 






aca 


gee 


gtc 


gec att 


gtc 


gcg 


thr 


ala 


val 


ala ile 


val 


ala 



SEQ ID NOSs 741-742 

FIG. 42E 



aat gee ggg cag cct ctg 
asn ala gly gin pro leu 

ctg ctg gee egg ctg ggc 
leu leu ala arg leu gly 

ctg ctg gee cgc gec gag 
leu leu ala arg ala glu 

cgc acc gca gee gaa ate 
arg thr ala ala glu ile 

ccc aac gcg gee aac ctg 
pro asn ala ala asn leu 

gac gcg gee gat acc gga 
asp ala ala asp thr gly 

ctg acc atg caa ggc aac 
leu thr met gin gly asn 

agg ccg ggt ggc cgc tac 
arg pro gly gly arg tyr 

cag gtc cgc acc gac ctg 
gin val arg thr asp leu 

ctg acc gtt gcg gaa tgg 
leu thr val ala glu trp 

gtc acc get tec atg gcg 
val thr ala ser met ala 

ggt gcg ctg egg ttc gee 
gly ala leu arg phe ala 

ttg atg cgc cac aca ttc 
leu met arg his thr phe 

cac aaa ccg cac gtc gat 
his lys pro his val asp 




1/1 ^SEQ ID NO: 743 31/11 



ate gcg 


cgt 


gac 


ate 


gat 


gac 


cag 


ggt 


egg 


ctg tgt 


ctg 


gac 


gtc 


ggc 


ggt 


cga 


acg 


gta 


(ile ala 


arg 


asp 


ile 


asp 


asp 


gin 


gly 


arg 


leu cys 


leu 


asp 


val 


gly 


gly 


arg 


thr 


val 


61/21" - 


— SEQ ID 


NO: 


744 










91/31 


















gtt gtt 


tea 


gcg 


ggc 


gac 


gtg 


gtg 


cat 


ttg 


cgt taa 


etc 


gcg 


egg 


age 


tgg 


cgt 


ccc 


caa 


val val 


ser 


ala 


gly 


asp 


val 


val 


his 


leu 


arg)OCH(leu 


ala 


arg 


ser 


trp 


arg 


pro 


gin 


121/41 


















151/51 


^SEQ ID 


NO: 


745 










aag att 


aag 


gt c 


gcg 


ggc 


atg 


age 


tat 


ccg 


gag aat 


gtc 


ctg 


gec 


get 


ggc 


gag 


cag 


gtc 


lys ile 


lys 


val 


ala 


gly 


met 


ser 


tyr 


pro 


glu asn 


val 


leu 


ala 


ala 


gly 


glu 


gin 


val 


181/61 


















211/71 


















gtt ctg 


cac 


cgc 


„ 4- 

cac 


ccg 


cac 


tgg 


aat 


cgc 


tta ate 


tgg 


ccc 


gtc 


gtg 


gtg 


ctg 


y i— w 


t ta 


val leu 


his 


arg 


his 


pro 


his 


trp 


asn 


arg 


leu ile 


trp 


pro 


val 


val 


val 


leu 


val 


leu 


241/81 


















271/91 


















ctg acc 


ggg 


ttg 


gcg 


gcg 


ttc 


ggg 


tec 


gga 


ttc gtc 


aac 


teg 


aca 


cct 


tgg 


cag 


cag 


ate 


leu thr 


gly 


leu 


ala 


ala 


phe 


gly 


ser 


gly 


phe val 


asn 


ser 

- 


thr 


pro 


trp 


gin 


gin 


ile) 














SEQ ID NOSs743- 


745 






























FIG. 43A 


















1/l^SEQ ID 


NO: 


746 












31/11 


















teg cgc 


gtg 


aca 


teg 


atg 


acc 


agg 


gtc 


ggc 


tgt gtc 


tgg 


acg 


teg 


gcg 


gtc 


gaa 


egg 


tag 


( ser arg 


val 


thr 


ser 


met 


thr 


arg 


val 


gly 


cys val 


trp 


thr 


ser 


ala 


val 


glu 


arg) AMB 


61/21^SEQ ID NO; 


: 747 










91/31 


















ttg ttt 


cag 


egg 


gcg 


acg 


tgg 


tgc 


att 


tgc 


gtt aac 


teg 


cgc 


gga 


get 


ggc 


gtc 


ccc 


aaa 


(leu phe 


gin 


arg 


ala 


thr 


trp 


cys 


ile 


cys 


val asn 


ser 


arg 


gly 


ala 


gly 


val 


pro 


lys 


121/41 


SEQ ID 


NO: 


748 










151/51 


















aga tta 


agg 


teg 


egg 


gca 


tga 


get 


ate 


egg 


aga atg 


tec 


tgg 


ccg 


ctg 


gcg 


age 


agg 


teg 


arg leu 


arg 


ser 


arg 


ala)OPA(ala 


ile 


arg 


arg met 


ser 


trp 


pro 


leu 


ala 


ser 


arg 


ser 


181/61 


SEQ ID NO: 749 — ; 








211/71 


















ttc tgc 


acc 


gee 


ate 


cgc 


act 


gga 


ate 


get 


taa tct 


ggc 


ccg 


teg 


tgg 


tgc 


tgg 


tct 


tgc 


phe cys 


thr 


ala 


ile 


arg 


thr 


gly 


ile 


ala)OCH(ser 


gly 


pro 


ser 


trp 


cys 


trp 


ser 


cys) 


241/81 


















271/91 


SEQ ID 


NO: 


750 










tga ccg 


ggt 


tgg 


egg 


cgt 


teg 


ggt 


ccg 


gat 


teg tea 


act 


cga 


cac 


ctt 


ggc 


age 


aga 


tc 


OPA(pro 


gly 


trp 


arg 


arg 


ser 


gly 


pro 


asp 


ser ser 


thr 


arg 


his 


leu 


gly 


ser 


arg) 



SEQ ID NO: 751 



SEQ ID NOSs746-751 

FIG. 43B 



1/1____SEQ ID 


NO: 


752 












31/11 


















cgc gcg tga 


cat 


cga 


tga 


cca 


ggg 


teg 


get 


gtg tct 


gga 


cgt 


egg 


egg 


teg 


aac 


ggt 


agt 


arg ala OPA 


his 


arg 


OPA (pro 


gly 


ser 


ala 


val ser 


gly 


arg 


arg 


arg 


ser 


asn 


gly 


ser 


61/21 SEQ ID NO: 753~^ 










91/31 


















tgt ttc age 


ggg 


cga 


cgt 


ggt 


gca 


ttt 


gcg 


tta act 


cgc 


gcg 


gag 


ctg 


gcg 


tec 


cca 


aaa 


cys phe ser 


gly 


arg 


arg 


gly 


ala 


phe 


ala 


leu thr 


arg 


ala 


glu 


leu 


ala 


ser 


pro 


lys 


121/41 
















151/51 


















gat taa ggt 


cgc 


ggg 


cat 


gag 


cca 


tec 


gga 


gaa tgt 


cc t 


ggc 


cgc 


tgg 


cga 


gca 


ggt 


cgc 


asp)OCH(gly 


arg 


gly 


his 


glu 


leu 


ser 


gly 


glu cys 


pro 


gly 


arg 


trp 


arg 


ala 


gly 


arg 


181/61 ! 


SEQ ID NO: 754 








211/71 


















tct gca ccg 


cca 


tec 


gca 


ctg 


gaa 


teg 


ctt 


aat ctg 


gee 


cgt 


cgt 


ggt 


get 


ggt 


ctt 


get 


ser ala pro 


pro 


ser 


ala 


leu 


glu 


ser 


leu 


asn leu 


ala 


arg 


arg 


gly 


ala 


gly 


leu 


ala 


241/81 
















271/91 


















gac egg gtt 


ggc 


ggc 


gtt 


egg 


gtc 


egg 


att 


cgt caa 


etc 


gac 


ace 


ttg 


gca 


gca 


gat 


c 


asp arg val 


gly 


gly 


val 


arg 


val 


arg 


ile 


arg gin 


leu 


asp 


thr 


leu 


ala 


ala 


asp) 





SEQ ID NOSs 752-754 

FIG. 43C 



Coding sequence Rv3278c predicted by Cole et al . , 1998 (Nature 393:537-544) 
and containing seq43A: 



1/1__SEQ 


! ID 


NO: 


755 












31/11 


















atg age 


tat 


ccg 


gag 


aat 


gtc 


ctg 


gec 


get 


ggc gag 


cag 


gtc 


gtt 


ctg 


cac 


cgc 


cat 


ccg 


(Met ser 


tyr 


pro 


glu 


asn 


val 


leu 


ala 


ala 


gly glu 


gin 


val 


val 


leu 


his 


arg 


his 


pro 


6i/2l"^SEQ ID NO: 756 










91/31 


















cac tgg 


aat 


cgc 


tta 


ate 


tgg 


ccc 


gtc 


gtg 


gtg ctg 


gtc 


ttg 


ctg 


ace 


ggg 


ttg 


gcg 


gcg 


his trp 


asn 


arg 


leu 


ile 


trp 


pro 


val 


val 


val leu 


val 


leu 


leu 


thr 


gly 


leu 


ala 


ala 


121/41 


















151/51 


















ttc ggg 


tec 


gga 


ttc 


gtc 


aac 


teg 


aca 


cct 


tgg cag 


cag 


ate 


get 


aag 


aac 


gtg 


att 


cac 


phe gly 


ser 


gly 


phe 


val 


asn 


ser 


thr 


pro 


trp gin 


gin 


ile 


ala 


lys 


asn 


val 


ile 


his 


181/61 


















211/71 


















gcg gtc 


ate 


tgg 


ggg 


ate 


tgg 


ttg 


gtg 


ate 


gtc ggc 


tgg 


etc 


acg 


ctg 


tgg 


cca 


ttc 


ctg 


ala val 


ile 


trp 


gly 


ile 


trp 


leu 


val 


ile 


val gly 


trp 


leu 


thr 


leu 


trp 


pro 


phe 


leu 


241/81 


















271/91 


















age tgg 


ctg 


ace 


aca 


cat 


ttc 


gtg 


gtg 


ace 


aac egg 


egg 


gtg 


atg 


ttc 


egg 


cat 


ggt 


gtg 


ser trp 


leu 


thr 


thr 


his 


phe 


val 


val 


thr 


asn arg 


arg 


val 


met 


phe 


arg 


his 


gly 


val 


301/101 


















331/111 


















ctg ace 


cgc 


age 


ggg 


ate 


gac 


ata 


ccg 


eta 


gca egg 


ate 


aac 


age 


gtg 


gag 


ttc 


egg 


gac 


leu thr 


arg 


ser 


gly 


ile 


asp 


ile 


pro 


leu 


ala arg 


ile 


asn 


ser 


val 


glu 


phe 


arg 


asp 


361/121 


















391/131 


















egg ate 


ttc 


gag 


egg 


att 


ttt 


cgc 


ace 


ggg 


acg ttg 


att 


ate 


gag 


tec 


gcg 


tea 


caa 


gat 


arg ile 


phe 


glu 


arg 


ile 


phe 


arg 


thr 


gly 


thr leu 


ile 


ile 


glu 


ser 


ala 


ser 


gin 


asp 


421/141 


















451/151 


















ccg etc 


gag 


ttc 


tac 


aac 


att 


ccg 


cgc 


ctg 


egg gag 


gtg 


cat 


gcg 


ttg 


ctg 


tat 


cac 


gag 


pro leu 


glu 


phe 


tyr 


asn 


ile 


pro 


arg 


leu 


arg glu 


val 


his 


ala 


leu 


leu 


tyr 


his 


glu 


481/161 


















511/171 


















gtt ttc 


gac 


ace 


ctg 


ggc 


tec 


gac 


gag 


teg 


ccc age 


tga 
















val phe 


asp 


thr 


leu 


gly 


ser 


asp 


glu 


ser 


pro ser) OPA 


















SEQ ID NOS 1755-756 

FIG. 43D 




ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv3278c 



1/l^SEQ ID 


NO: 


757 












31/11 


















taa etc 


gcg 


egg 


age 


tgg 


cgt 


ccc 


caa 


aag 


att aag 


gtc 


gcg 


ggc 


atg 


age 


tat 


ccg 


gag 


OCH(leu 


ala 


arg 


ser 


trp 


arg 


pro 


gin 


lys 


ile lys 


val 


ala 


gly 


met 


ser 


tyr 


pro 


glu 


61/21^-SEQ ID NO: 758 














91/31 












aat gtc 


ctg 


gee 


get 


ggc 


gag 


cag 


gtc 


gtt 


ctg cac 


cgc 


cat 


ccg 


cac 


tgg 


aat 


cgc 


tta 


asn val 


leu 


ala 


ala 


gly 


glu 


gin 


val 


val 


leu his 


arg 


his 


pro 


his 


trp 


asn 


arg 


leu 


121/41 


















151/51 


















ate tgg 


ccc 


gtc 


gtg 


gtg 


ctg 


gtc 


ttg 


ctg 


acc ggg 


ttg 


gcg 


gcg 


ttc 


ggg 


tec 


gga 


ttc 


ile trp 


pro 


val 


val 


val 


leu 


val 


leu 


leu 


thr gly 


leu 


ala 


ala 


phe 


gly 


ser 


gly 


phe 


181/61 


















211/71 


















gtc aac 


teg 


aca 


cct 


tgg 


cag 


cag 


ate 


get 


aag aac 


gtg 


att 


cac 


gcg 


gtc 


ate 


tgg 


ggg 


val asn 


ser 


thr 


P r o 


trp 


gin 


gin 


ile 


ala 


lys asn 


val 


ile 


his 


ala 


val 


ile 


trp 


gly 


241/81 


















271/91 


















ate tgg 


ttg 


gtg 


ate 


gtc 


ggc 


tgg 


etc 


acg 


ctg tgg 


cca 


ttc 


ctg 


age 


tgg 


ctg 


acc 


aca 


ile trp 


leu 


val 


ile 


val 


gly 


trp 


leu 


thr 


leu trp 


pro 


phe 


leu 


ser 


trp 


leu 


thr 


thr 


301/101 


















331/111 


















cat ttc 


gtg 


gtg 


acc 


aac 


egg 


egg 


gtg 


atg 


ttc egg 


cat 


ggt 


gtg 


ctg 


acc 


cgc 


age 


ggg 


his phe 


val 


val 


thr 


asn 


arg 


arg 


val 


met 


phe arg 


his 


gly 


val 


leu 


thr 


arg 


ser 


gly 


361/121 


















391/131 


















ate gac 


ata 


ccg 


eta 


gca 


egg 


ate 


aac 


age 


gtg gag 


ttc 


egg 


gac 


egg 


ate 


ttc 


gag 


egg 


ile asp 


ile 


pro 


leu 


ala 


arg 


ile 


asn 


ser 


val glu 


phe 


arg 


asp 


arg 


ile 


phe 


glu 


arg 


421/141 


















451/151 


















att ttt 


cgc 


ace 


ggg 


acg 


ttg 


att 


ate 


gag 


tec gcg 


tea 


caa 


gat 


ccg 


etc 


gag 


ttc 


tac 


ile phe 


arg 


thr 


gly 


thr 


leu 


ile 


ile 


glu 


ser ala 


ser 


gin 


asp 


pro 


leu 


glu 


phe 


tyr 


481/161 


















511/171 


















aac att 


ccg 


cgc 


ctg 


egg 


gag 


gtg 


cat 


gcg 


ttg ctg 


tat 


cac 


gag 


gtt 


ttc 


gac 


acc 


ctg 


asn ile 


pro 


arg 


leu 


arg 


glu 


val 


his 


ala 


leu leu 


tyr 


his 


glu 


val 


phe 


asp 


thr 


leu 



541/181 

ggc tec gac gag teg ccc age tga 
gly ser asp glu ser pro ser)OPA 



SEQ ID NOSs757-758 

FIG. 43E 




1/1_SEQ ID NO: 759 31/11 



gcc aag 


atg 


gat 


gtc 


tac 


caa 


cgc 


acc 


gcc 


gcc ggc 


tgg 


cag 


ccg 


etc 


aag 


acc 


ggt 


ate 


(alalys 


met 


asp 


val 


tyr 


gin 


arg 


thr 


ala 


ala gly 


trp 


gin 


pro 


leu 


lys 


thr 


gly 


ile 


61/21 ^ 


SEQ 


ID NO: 760 










91/31 


















acc acc 


cat 


ate 


ggt 


teg 


gcg 


ggc 


atg 


gcg 


ccg gaa 


gcc 


aag 


age 


gga 


tat 


ccg 


gcc 


act 


thr thr 


his 


ile 


gly 


ser 


ala 


gly 


met 


ala 


pro glu 


ala 


lys 


ser 


gly 


tyr 


pro 


ala 


thr 


121/41 


















151/51 


















ccg atg 


ggg 


gtt 


tac 


age 


ctg 


gac 


tec 


get 


ttt ggc 


acc 


gcg 


ccg 


aat 


ccc 


ggt 


ggc 


ggg 


pro met 


gly 


val 


tyr 


ser 


leu 


asp 


ser 


ala 


phe gly 


thr 


ala 


pro 


asn 


pro 


gly 


gly 


gly 


181/61 


















211/71 


















ttg ccg 


tat 


acc 


caa 


gtc 


gga 


ccc 


aat 


cac 


tgg tgg 


agt 


ggc 


gac 


gac 


aat 


age 


ccc 


acc 


leu pro 


tyr 


thr 


gin 


val 


gly 


pro 


asn 


his 


trp trp 


ser 


gly 


asp 


asp 


asn 


ser 


pro 


thr 


241/81 


















271/91 


















ttt aac 


tec 


atg 


cag 


gtc 


tgt 


cag 


aag 


tec 


cag tgc 


ccg 


ttc 


age 


acg 


gcc 


gac 


age 


gag 


phe asn 


ser 


met 


gin 


val 


cys 


gin 


lys 


ser 


gin cys 


pro 


phe 


ser 


thr 


ala 


asp 


ser 


glu 


301/101 


















331/111 


















aac ctg 


caa 


ate 


ccg 


cag 


tac 


aag 


cat 


teg 


gtc gtg 


atg 


ggc 


gtc 


aac 


aag 


gcc 


aag 


gtc 


asn leu 


gin 


ile 


pro 


gin 


tyr 


lys 


his 


ser 


val val 


met 


gly 


val 


asn 


lys 


ala 


lys 


val 


361/121 


















391/131 


















cca ggc 


aaa 


ggc 


tec 


gcg 


ttc 


ttc 


ttt 


cac 


acc acc 


gac 


ggc 


ggg 


ccc 


acc 


gcg 


ggt 


tgt 


pro gly 


lys 


gly 


ser 


ala 


phe 


phe 


phe 


his 


thr thr 


asp 


gly 


gly 


pro 


thr 


ala 


gly 


cys 


421/141 




































gtg gcg 


ate 


































val ala 


ile) 












































SEQ ID BIOS s 759- 


760 






























FIG. 44A 


















1/l^SEQ ID 


NO: 


761 












31/11 


















cca aga 


tgg 


atg 


tct 


acc 


aac 


gca 


ccg 


ccg 


ccg get 


ggc 


age 


cgc 


tea 


aga 


ccg 


gta 


tea 


(pro arg 


trp 


met 


ser 


thr 


asn 


ala 


pro 


pro 


pro ala 


gly 


ser 


arg 


ser 


arg 


pro 


val 


ser 


6T72i^- 


-SEQ 


ID NO: 762 










91/31 


















cca ccc 


ata 


teg 


gtt 


egg 


egg 


gca 


tgg 


cgc 


egg aag 


cca 


aga 


gcg 


gat 


ate 


egg 


cca 


etc 


pro pro 


ile 


ser 


val 


arg 


arg 


ala 


trp 


arg 


arg lys 


pro 


arg 


ala 


asp 


ile 


arg 


pro 


leu 


121/41 


















151/51 


















cga tgg 


ggg 


ttt 


aca 


gcc 


tgg 


act 


ccg 


ctt 


ttg gca 


ccg 


cgc 


cga 


ate 


ccg 


gtg 


gcg 


ggt 


arg trp 


gly 


phe 


thr 


ala 


trp 


thr 


pro 


leu 


leu ala 


pro 


arg 


arg 


ile 


pro 


val 


ala 


gly 


181/61 


















211/71 


















tgc cgt 


ata 


ccc 


aag 


teg 


gac 


cca 


ate 


act 


ggt gga 


gtg 


gcg 


acg 


aca 


ata 


gcc 


cca 


cc t 


cys arg 


ile 


pro 


lys 


ser 


asp 


pro 


ile 


thr 


gly gly 


val 


ala 


tnr 


tnr 


i ±e 


axa 


pro 


pro 


241/81 


















271/91 


















tta act 


cca 


tgc 


agg 


tct 


gtc 


aga 


agt 


ccc 


agt gcc 


cgt 


tea 


gca 


egg 


ccg 


aca 


gcg 


aga 


leu thr 


pro 


cys 


arg 


ser 


val 


arg 


ser 


pro 


ser ala 


arg 


ser 


ala 


arg 


pro 


thr 


ala 


arg 


301/101 


















331/111 


















acc tgc 


aaa 


tec 


cgc 


agt 


aca 


age 


att 


egg 


teg tga 


tgg 


gcg 


tea 


aca 


agg 


cca 


agg 


tec 


thr cys 


lys 


ser 


arg 


ser 


thr 


ser 


ile 


arg 


ser)OPA(trp 


ala 


ser 


thr 


arg 


pro 


arg 


ser 


361/121 


















391/131 


SEQ ID 


NO: 


763 










cag gca 


aag 


get 


ccg 


cgt 


tct 


tct 


ttc 


aca 


cca ccg 


acg 


gcg 


ggc 


cca 


ccg 


egg 


gtt 


gtg 


gin ala 


lys 


ala 


pro 


arg 


ser 


ser 


phe 


thr 


pro pro 


thr 


ala 


gly 


pro 


pro 


arg 


val 


val 



421/141 
tgg cga tc 
trp arg) 



SEQ ID NOSs761-763 



FIG. 44B 




1/1_SEQ ID NO: 764 31/11 

caa gat gga tgt eta cca acg cac cgc cgc egg ctg gca gee get caa gac egg tat cac 
( gin a sp gly cys leu pro thr his arg arg arg leu ala ala ala gin asp arg tyr his 
6T72T^SEQ ID NO: 765 91/31 

cac cca tat egg ttc ggc ggg cat ggc gee gga age caa gag egg ata tec ggc cac tec 
his pro tyr arg phe gly gly his gly ala gly ser gin glu arg ile ser gly his ser 
121/41 151/51 

gat ggg ggt tta cag cct gga etc cgc ttt tgg cac cgc gec gaa. tec egg tgg egg gtt 
asp gly gly leu gin pro gly leu arg phe trp his arg ala glu ser arg trp arg val 
181/61 211/71 

gee gta tac cca agt egg ace caa tea ctg gtg gag tgg cga cga caa tag ccc cac ctt 
ala val tyr pro ser arg thr gin ser leu val glu trp arg arg gin) AMB pro his leu 
241/81 271/91 

taa etc cat gca ggt ctg tea gaa gtc cca gtg ccc gtt cag cac ggc cga cag cga gaa 
OCH(leu his ala gly leu ser glu val pro val pro val gin his gly arg gin arg glu 
301/101 ^SEQ ID NO: 766 331/111 

cct gca aat ccc gca gta caa gca ttc ggt cgt gat ggg cgt caa caa ggc caa ggt ccc 
pro ala asn pro ala val gin ala phe gly arg asp gly arg gin gin gly gin gly pro 
361/121 391/131 

agg caa agg etc cgc gtt ctt ctt tea cac cac cga egg egg gee cac cgc ggg ttg tgt 
arg gin arg leu arg val leu leu ser his his arg arg arg ala his arg gly leu cys 
421/141 
ggc gat c 
gly asp) 



SEQ ID NOSs 764-766 



FIG. 44C 




Coding sequence Rv0309 predicted by Cole et al , 
and containing Seq44A: 



1998 (Nature 393:537-544) 



1/1___SEQ ID NO: 767 

aEg age cga etc eta get ttg ctg tgc 
( Met se r arg leu leu ala leu leu cys 
61/21""" — SEQ ID NO: 7 68 
etc gcg cca gtg age ctg gee gtc gtc 
leu ala pro val ser leu ala val val 
121/41 

act cag gtg gtt teg gtg gtg gga acc 
thr gin val val ser val val gly thr 
181/61 

cgc acc gee gec ggc tgg cag ccg etc 
arg thr ala ala gly trp gin pro leu 
241/81 

ggc atg gcg ccg gaa gee aag age gga 
gly met ala pro glu ala lys ser gly 
301/101 

gac tec get ttt ggc acc gcg ccg aat 
asp ser ala phe gly thr ala pro asn 
361/121 

ccc aat cac tgg tgg agt ggc gac gac 
pro asn his trp trp ser gly asp asp 
421/141 

cag aag tec cag tgc ccg ttc age acg 
gin lys ser gin cys pro phe ser thr 
481/161 

aag cat teg gtc gtg atg ggc gtc aac 
lys his ser val val met gly val asn 
541/181 

ttc ttt cac acc acc gac ggc ggg ccc 
phe phe his thr thr asp gly gly pro 
601/201 

ctg gtg cag ate ate cgt tgg ctg egg 
leu val gin ile ile arg trp leu arg 



31/11 

get gcg gta tgc acg ggc tgc gtt get gtg gtt 
ala ala val cys thr gly cys val ala val val 
91/31 

aac ccg tgg ttc gcg aac teg gtc ggc aat gee 
asn pro trp phe ala asn ser val gly asn ala 
151/51 

ggc ggt teg acg gee aag atg gat gtc tac caa 
gly gly ser thr ala lys met asp val tyr gin 
211/71 

aag acc ggt ate acc acc cat ate ggt teg gcg 
lys thr gly ile thr thr his ile gly ser ala 
271/91 

tat ccg gee act ccg atg ggg gtt tac age ctg 
tyr pro ala thr pro met gly val tyr ser leu 
331/111 

ccc ggt ggc ggg ttg ccg tat acc caa gtc gga 
pro gly gly gly leu pro tyr thr gin val gly 
391/131 

aat age ccc acc ttt aac tec atg cag gtc tgt 
asn ser pro thr phe asn ser met gin val cys 
451/151 

gee gac age gag aac ctg caa ate ccg cag tac 
ala asp ser glu asn leu gin ile pro gin tyr 
511/171 

aag gee aag gtc cca ggc aaa ggc tec gcg ttc 
lys ala lys val pro gly lys gly ser ala phe 
571/191 

acc gcg ggt tgt gtg gcg ate gac gat gee acg 
thr ala gly cys val ala ile asp asp ala thr 
631/211 

cct ggt gcg gtg ate gcg ate gee aag taa 
pro gly ala val ile ala ile ala lys)OCH 



SEQ ID KTOSs767-768 



FIG, 44D 




ORF according to Cole et al . , 1998 



1/l^SEQ ID NO: 769 

tga gcg atg age cga etc eta get ttg 
OPA(ala met ser arg leu leu ala leu 

61/2?*" SEQ ID NO: 770 

gtg gtt etc gcg cca gtg age ctg gec 
val val leu ala pro val ser leu ala 
121/41 

aat gec act cag gtg gtt teg gtg gtg 
asn ala thr gin val val ser val val 
181/61 

tac caa cgc ace gee gee ggc tgg cag 
tyr gin arg thr ala ala gly trp gin 
241/81 

teg gcg ggc atg gcg ccg gaa gec aag 
ser ala gly met ala pro glu ala lys 
301/101 

age ctg gac tec get ttt ggc ace gcg 
ser leu asp ser ala phe gly thr ala 
361/121 

gtc gga ccc aat cac tgg tgg agt ggc 
val gly pro asn his trp trp ser gly 
421/141 

gtc tgt cag aag tec cag tgc ccg ttc 
val cys gin lys ser gin cys pro phe 
481/161 

cag tac aag cat teg gtc gtg atg ggc 
gin tyr lys his ser val val met gly 
541/181 

gcg ttc ttc ttt cac ace ace gac ggc 
ala phe phe phe his thr thr asp gly 
601/201 

gee acg ctg gtg cag ate ate cgt tgg 

ala thr leu val gin ile ile arg trp 

661/221 

taa 

OCH 



Nature 393:537-544) and containing Rv0309 



31/11 

ctg tgc get gcg gta tgc acg ggc tgc gtt get 
leu cys ala ala val cys thr gly cys val ala 
91/31 

gtc gtc aac ccg tgg ttc gcg aac teg gtc ggc 
val val asn pro trp phe ala asn ser val gly 
151/51 

gga acc ggc ggt teg acg gec aag atg gat gtc 
gly thr gly gly ser thr ala lys met asp val 
211/71 

ccg etc aag acc ggt ate acc acc cat ate ggt 
pro leu lys thr gly ile thr thr his ile gly 
271/91 

age gga tat ccg gee act ccg atg ggg gtt tac 
ser gly tyr pro ala thr pro met gly val tyr 
331/111 

ccg aat ccc ggt ggc ggg ttg ccg tat acc caa 
pro asn pro gly gly gly leu pro tyr thr gin 
391/131 

gac gac aat age ccc acc ttt aac tec atg cag 
asp asp asn ser pro thr phe asn ser met gin 
451/151 

age acg gee gac age gag aac ctg caa ate ccg 
ser thr ala asp ser glu asn leu gin ile pro 
511/171 

gtc aac aag gee aag gtc cca ggc aaa ggc tec 
val asn lys ala lys val pro gly lys gly ser 
571/191 

ggg ccc acc gcg ggt tgt gtg gcg ate gac gat 
gly pro thr ala gly cys val ala ile asp asp 
631/211 

ctg egg cct ggt gcg gtg ate gcg ate gec aag 
leu arg pro gly ala val ile ala ile ala lys) 



SEQ ID NOSs769-770 



FIGo 44E 



Cloned fragment fused with phoA 



1/1 SEQ ID NO: 771 31/11 



gat etc 


ccc 


gga 


cac 


cag 


gtc 


1- r-> 

acc 


egg 


cga 


gat ggt 


gat 


cga 


ggc 


teg 


gac 


ccg 


cag 


gca 


(asp leu 


pro 


gly 


his 


gin 


val 


ile 


arg 


arg 


asp gly 


asp 


arg 


gly 


ser 


asp 


pro 


gin 


ala 


61/21^SEQ ID NO: 772 










91/31 


















l. v_» v_ y y l 


age 


eag 


agg 


< — d v_» 


cag 


cat 


cag 


caa 


cat cgc 


gat 


ggc 


cag 


cat 


gec 


gcg 


ccg 


teg 


ser gly 


ser 


gin 


cix y 


hi s 




hi s 


gin 


gin 


his arg 


asp 


gly 


gin 


his 


ala 


ala 


pro 


ser 


191 / A 1 


















151/51 


















y y t t_ t l. 


t gc 


eac 


teg- 


cga 


tec 


ttg 


gga 


tga 


egg tgg 


ggc 


ata 


get 


age 


gcg 


cac 


cag 


gtc 


y -i-.y f 


cy s 


hi s 


ser 


arg 


ser 


1 eu 


gly)OPA(arg trp 


gly 


ile 


ala 


ser 


ala 


his 


gin 


val 


lol/ OX 










TT\ 

X.U 


jnu : 


/ / J 




911 /71 
Zll/ IX 


















ate gtg 


cca 


gac 


egg 


gca 


tgc 


cgc 


gtc 


ggc 


aag ctg 


teg 


ggc 


gcg 


ggt 


tag 


age 


ggt 


age 


ile val 


pro 


asp 


arg 


ala 


cys 


arg 


vax 


gly 


lys leu 


ser 


gly 


ala 


gly) AMB(ser 


gly 


ser 


o 1 


















271/91 


SEQ ID NO: 774^ 








gtg cga 


ccc 


agg 


atg 


gcg 


aat 


get 


egg 


ggg 


tea ccg 


gcg 


aag 


tgg 


tag 


ccg 


egg 


atg 


atg 


val airg 


pro 


arg 


met 


ala 


asn 


ala 


arg 


gly 


ser pro 


ala 


lys 


trp) AMB (pro 


arg 


met 


met 


jUX / -L U X 


















331/111 


SEQ ID 


NO: 


775- 










teg gtg 


aag 


ccc 


aac 


egg 


egg 


tac 


aac 


cgc 


cac gee 


cga 


ttg 


tec 


tca 


ccg 


ttg 


gtc 


tec 


ser val 


lys 


pro 


asn 


arg 


arg 


tyr 


asn 


arg 


his ala 


arg 


leu 


ser 


ser 


pro 


leu 


val 


ser 


TCI /I 

Jul / A- A I. 


















391/131 


















ggt gcg 


gag 


age 


agg 


acg 


ttg 


tec 


teg 


teg 


cga ccg 


get 


age 


agt 


egg 


egg 


gee 


aac 


gee 


gly val 


glu 


ser 


arg 


thr 


leu 


ser 


ser 


ser 


arg pro 


ala 


ser 


ser 


arg 


arg 


ala 


asn 


ala 


41 Z X / X 41 X 


















451/151 


















tec ccg 


agg 


cca 


egg 


cct 


tga 


gcg 


egg 


gga 


agg atg 


tgc 


aat 


tea 


gtc 


aac 


teg 


aag 


tag 




arg 


pro 


arg 


pro)OPA(ala 


arg 


gly 


arg met 


cys 


asn 


ser 


val 


asn 


ser 


lys ) AMB 


41 O X / iOl 


SEQ ID NO: 


776-^ 








511/171 


















ccg ytc 


ate 


agt 


egg 


gcg 


ate 


get 


agg 


cgc 


gga aag 


ccg 


ctg 


cgt 


tgc 


aag 


ccc 


agt 


acc 


V -L- V GL -L. 


ile 


ser 


arg 


ala 


ile 


ala 


arg 


arg 


gly lys 


pro 


leu 


arg 


cys 


lys 


pro 


ser 


thr 


541/181 


^SEQ ID 


NO: 


777 










571/191 




















tgt 


tgc 


cac 


cac 


tgg 


ccg 


ggc 


gec 


ccg gga 


tag 


ccg 


tac 


gee 


act 


ccg 


age 


att 


cm. _y £a 


cys 


cys 


his 


his 


trp 


pro 


gly 


ala 


pro gly)AMB(pro 


tyr 


ala 


thr 


pro 


ser 


ile 


fim /9m 

w VJ _L / U i. 


















631/211 




^-SEQ ID NO 


: 778 






gg^ gcg 


ttg 


etc 


agt 


teg 


gcg 


gec 


gac 


ggc 


age gee 


gtg 


gtg 


teg 


gcg 


gee 


teg 


gee 


tgt 


gly ala 


leu 


leu 


ser 


ser 


ala 


ala 


asp 


gly 


ser ala 


val 


val 


ser 


ala 


ala 


ser 


ala 


cys 




















691/231 




















gee 


gtt 


acc 


teg 


acg 


gee 


gcg 


acc 


gee tgc 


cag 


ccg 


cgc 


cgc 


egg 


atg 


tgc 


tec 


ser ala 


ala 


val 


thr 


ser 


thr 


ala 


ala 


thr 


ala cys 


gin 


pro 


arg 


arg 


arg 


met 


cys 


ser 


721/241 


















751/251 


















age cac 


att 


ggg 


gcg 


cgc 


aaa 


gtc 


teg 


gtg 


ccc ctg 


ggg 


tag 


cgc 


ate 


gcg 


teg 


aca 


tac 


ser his 


ile 


gly 


ala 


arg 


lys 


val 


ser 


val 


pro leu 


gly) AMB(arg 


ile 


ala 


ser 


thr 


tyr 


781/261 


















811/271 






^-SEQ ID NO 


: 779 




ace gtc 


agg 


gca 


tea 


ccg 


agg 


egg 


cgc 


tec 


ata teg 


ctg 


ggc 


ggc 


aga 


teg 


atg 


agg 


aat 


thr val 


arg 


ala 


ser 


pro 


arg 


arg 


arg 


ser 


ile ser 


leu 


gly 


gly 


arg 


ser 


met 


arg 


asn 


841/281 


















871/291 


















ate gee 


aac 


gcg 


egg 


tgt 


cct 


cct 


cat 


gtg 


atg aac 


cga 


tgc 


gtg 


ctt 


gcg 


cac 


cag 


tat 


ile ala 


asn 


ala 


arg 


cys 


pro 


pro 


his 


val 


met asn 


arg 


cys 


val 


leu 


ala 


his 


gin 


tyr 


901/301 


















931/311 


















egg aca 


age 


cga 


tga 


ggc 


cgc 


ccg 


cgc 


tgg 


acg ggg 


ctt 


gta 


gcg 


tat 


ggc 


cgt 


ttc 


cgc 


arg thr 


ser 


arg)OPA(gly 


arg 


pro 


arg 


trp 


thr gly 


leu 


val 


ala 


tyr 


gly 


arg 


phe 


arg 










^ SEQ ID NO 


: 780 





















SEQ ID NOSs 771-783 




961/321 

tea get cgt cgc tgc ggc gec gec ggg ata 
ser ala arg arg cys gly ala ala gly ile 
1021/341 SEQ ID NO: 782x 

gat tga cct cgt ate ate tga\ gtt agt tgc 
asp)OPA(pro arg ile ile)OPA(val ser cys 
lOSl/SSl^SEQ ID MO: 781 

att acg tga cag tct gtc ggc aag gag gga 

ile thr)OPA(gln ser val gly lys glu gly 

1141/381 ^-SEQ ID NO: 783 

atg ctt gac cag ate gag age get etc tac 

met leu asp gin ile glu ser ala leu tyr 

1201/401 

cgt ggc ggg ggc ttc cgc gca ccg ace gcg 
arg gly gly gly phe arg ala pro thr ala 
1261/421 

ate ate ggt ctg ggg atg ttg gtt tec ggc 
ile ile gly leu gly met leu val ser gly 
1321/441 

ttc ccg ata etc age gtt ttc ggt tttgtc 
phe pro ile leu ser val phe gly phe val 
1381/461 

acc ggt cct egg ttg tec ggc agg atg gat 
thr gly pro arg leu ser gly arg met asp 
1441/481 

cag cgt cgt acc aag ggg gee ggg ggc tea 
gin arg arg thr lys gly ala gly gly ser 



991/331 

gaa teg ccc gcg aac cag tgg tac ggc gca 
glu ser pro ala asn gin trp tyr gly ala 
1051/351 

ccg cgc aat ggg cat ccg cgt gtt ate ggt 
pro arg asn gly his pro arg val ile gly 
1111/371 

cgc atg cca etc tec gat cat gag cag egg 
arg met pro leu ser asp his glu gin arg 
1171/391 

gec gaa gat ccc aag ttc gca teg agt gtc 
ala glu asp pro lys phe ala ser ser val 
1231/411 

egg egg cgc ctg cag ggc gcg gcg ttg ttc 
arg arg arg leu gin gly ala ala leu phe 
1291/431 

gtg gcg ttc aaa gag acc atg ate gga agt 
val ala phe lys glu thr met ile gly ser 
1351/451 

gtg atg ttc ggt ggt gtg gtg tat gee ate 
val met phe gly gly val val tyr ala ile 
1411/471 

cgt ggc gga teg get get ggg get teg cgc 
arg gly gly ser ala ala gly ala ser arg 
1471/491 

ttc acc age cgt atg gaa gat c 
phe thr ser arg met glu asp) 



SEQ ID NOSs 771-783 (continued) 

FIG. 45ZA (continued) 



f 

fragment seq45ZA shifted minus 1 for the reading frame 
1/1__,SEQ ID NO: 784 ' 31/11 

ate tec ccg gac ace agg tea tec ggc gag atg gtg ate gag get egg ace cgc agg cat 
( ile se r pro asp thr arg ser ser gly glu met val ile glu ala arg thr arg arg his 
61/21^" SEQ ID NO: 785 91/31 

ccg gta gee aga ggc acc age ate age aac ate gcg atg gee age atg ccg cgc cgt egg 
pro val ala arg gly thr ser ile ser asn ile ala met ala ser met pro arg arg arg 
121/41 151/51 

gtc ctt gee act cgc gat cct tgg gat gac ggt ggg gca tag eta gcg cgc acc agg tea 
val leu ala thr arg asp pro trp asp asp gly gly ala)AMB(ieu ala arg thr arg ser 
181/61 211/71 ^SEQ ID NO: 786 

teg tgc cag acc ggg cat gec gcg teg gca age tgt egg gcg egg gtt aga gcg gta gcg 
ser cys gin thr gly his ala ala ser ala ser cys arg ala arg val arg ala val ala 
241/81 271/91 

tgc gac cca gga tgg cga atg etc ggg ggt cac egg cga agt ggt age cgc gga tga tgt 
cys asp pro gly trp arg met leu gly gly his arg arg ser gly ser arg gly)OPA cys 
301/101 331/111 

egg tga age cca acc ggc ggt aca acc gec acg ccc gat tgt cct cac cgt tgg tct ccg 
arg OPA(ser pro thr gly gly thr thr ala thr pro asp cys pro his arg trp ser pro 
361/121 ^SEQ ID NO: 787 391/131 

gtg tgg aga gca gga cgt tgt cct cgt cgc gac egg eta gca gtc ggc ggg cca acg cct 
val trp arg ala gly arg cys pro arg arg asp arg leu ala val gly gly pro thr pro 
421/141 451/151 

ccc cga ggc cac ggc ctt gag cgc ggg gaa gga tgt gca att cag tea act cga agt age 
pro arg gly his gly leu glu arg gly glu gly cys ala ile gin ser thr arg ser ser 
481/161 511/171 

tgg tea tea gtc ggg cga teg eta ggc gcg gaa age cgc tgc gtt gca age cca gta cca 
trp ser ser val gly arg ser leu gly ala glu ser arg cys val ala ser pro val pro 
541/181 571/191 

cct get gtt gec acc act ggc egg gcg ccc egg gat age cgt acg cca etc cga gca ttg 
pro ala val ala thr thr gly arg ala pro arg asp ser arg thr pro leu arg ala leu 
601/201 631/211 

gcg cgt tgc tea gtt egg egg ccg acg gca gcg ccg tgg tgt egg egg cct egg cct gtt 
ala arg cys ser val arg arg pro thr ala ala pro trp cys arg arg pro arg pro val 
661/221 691/231 

egg ctg ccg tta cct cga egg ccg cga ccg cct gee age cgc gee gee gga tgt get cca 
arg leu pro leu pro arg arg pro arg pro pro ala ser arg ala ala gly cys ala pro 
721/241 751/251 

gee aca ttg ggg cgc gca aag tct egg tgc ccc tgg ggt age gca teg cgt cga cat aca 
ala thr leu gly arg ala lys ser arg cys pro trp gly ser ala ser arg arg his thr 
781/261 811/271 

ccg tea ggg cat cac cga ggc ggc get cca tat cgc tgg gcg gca gat cga tga gga ata 
pro ser gly his his arg gly gly ala pro tyr arg trp ala ala asp arg)OPA(gly ile 
841/281 871/291 SEQ ID NO: 788—^ 

teg cca acg cgc ggt gtc etc etc atg tga tga acc gat gcg tgc ttg cgc acc agt ate 
ser pro thr arg gly val leu leu met)OPA OPA( thr asp ala cys leu arg thr ser ile 
901/301 931/3lT^-SEQ ID NO: 789 

gga caa gec gat gag gec gec cgc get gga egg ggc ttg tag cgt atg gee gtt tec get 
gly gin ala asp glu ala ala arg ala gly arg gly leu)AMB(arg met ala val ser ala 
961/321 991/331 SEQ ID NO: 790 

cag etc gtc get gcg gcg ccg ccg gga tag aat cgc ccg cga acc agt ggt acg gcg cag 
gin leu val ala ala ala pro pro gly)AMB(asn arg pro arg thr ser gly thr ala gin 

^ SEQ ID NO: 791 



SEQ ID NOSs784-793 



FIG. 45ZB 




1021/341 1051/351 

att gac etc gta tea tct gag tta gtt gec cgc gca atg ggc ate cgc gtg tta teg gta 
ile asp leu val ser ser glu leu val ala arg ala met gly ile arg val leu ser.val 

1081/361 1111/371 
tta cgt gac agt ctg teg gca agg agg gac gca tgc cac tct ccg ate atg age age gga 
leu arg asp ser leu ser ala arg arg asp ala cys his ser pro ile met ser ser gly 
1141/381 1171/391 

tgc ttg acc aga teg aga gcg etc tct acg ccg aag ate cca agt teg cat cga gtg tec 
cys leu thr arg ser arg ala leu ser thr pro lys ile pro ser ser his arg val ser 
1201/401 1231/411 

gtg gcg ggg get tec gcg cac cga ccg cgc ggc ggc gec tgc agg gcg egg cgt tgt tea 
val ala gly ala ser ala his arg pro arg gly gly ala cys arg ala arg arg cys ser 
1261/421 1291/431 

tea teg gtc tgg gga tgt tgg ttt ccg gcg tgg cgt tea aag aga cca tga teg gaa gtt 

ser ser val trp gly cys trp phe pro ala trp arg ser lys arg pro)OPA(ser glu val 

1321/441 1351/451 SEQ ID NO: 792^ 

tec cga tac tea gcg ttt teg gtt ttg teg tga tgt teg gtg gtg tgg tgt atg cca tea 

ser arg tyr ser ala phe ser val leu ser)OPA( cys se r val val trp cys met pro ser 

1381/461 1411/471^-SEQ ID NO: 793 

ccg gtc etc ggt tgt ccg gca gga tgg ate gtg gcg gat egg ctg ctg ggg ctt cgc gee 

pro val leu gly cys pro ala gly trp ile val ala asp arg leu leu gly leu arg ala 

1441/481 1471/491 

age gtc gta cca agg ggg ccg ggg get cat tea cca gee gta tgg aag ate 
ser val val pro arg gly pro gly ala his ser pro ala val trp lys ile) 



SEQ ID KTOSs 784-793 (continued) 
FiOo <4SZB (continued) 



fragment seq45ZA shifted minus 2 for the reading frame 



1/1^ -SEQ ID NO: 794 31/11 

tct ccc egg aca cca ggt cat ccg gcg aga tgg tga teg agg etc gga ccc gca ggc ate 
ser pr o arg thr pro gly his pro ala arg trp)OPA(ser arg leu gly pro ala gly ile 
61/21 " — SEQ ID NO: 795 91/31 ^ SEQ ID NO: 796 

egg tag cca gag gca cca gca tea gca aca teg cga tgg cca gca tgc cgc gee gtc ggg 
arg)AMB(pro glu ala pro ala ser ala thr ser arg trp pro ala cys arg ala val gly 
121/41 ^ — SEQ ID NO: 797 151/51 

tec ttg cca etc gcg ate ctt ggg atg acg gtg ggg cat age tag cgc gca cca ggt cat 
ser leu pro leu ala ile leu gly met thr val gly his ser)AMB(arg ala pro gly his 
181/61 211/71 £-SEQ ID NO: 798 

cgt gee aga ccg ggc atg ccg cgt egg caa get gtc ggg cgc ggg tta gag egg tag cgt 
arg ala arg pro gly met pro arg arg gin ala val gly arg gly leu glu arg)AMB(arg 

241/81 271/91 SEQ ID NO: 799- * 

gcg acc cag gat ggc gaa tgc teg ggg gtc ace ggc gaa gtg gta gee gcg gat gat gtc 
ala thr gin asp gly glu cys ser gly val thr gly glu val val ala ala asp asp val 
301/101 331/111 

ggt gaa gee caa ccg gcg gta caa ccg cca cgc ccg att gtc etc acc gtt ggt etc egg 
gly glu ala gin pro ala val gin pro pro arg pro ile val leu thr val gly leu arg 
361/121 391/131 

tgt gga gag cag gac gtt gtc etc gtc gcg acc ggc tag cag teg gcg ggc caa cgc etc 
cys gly glu gin asp val val leu val ala thr gly)AMB(gln ser ala gly gin arg leu 

421/141 451/151 4 SEQ ID NO: 800 

ccc gag gee acg gee ttg age gcg ggg aag gat gtg caa ttc agt caa etc gaa gta get 
pro glu ala thr ala leu ser ala gly lys asp val gin phe ser gin leu glu val ala 
481/161 511/171 

ggt cat cag teg ggc gat cgc tag gcg egg aaa gee get gcg ttg caa gee cag tac cac 
gly his gin ser gly asp arg)AMB(ala arg lys ala ala ala leu gin ala gin tyr his 
541/181 SEQ ID NO: 801—-^ 571/191 

ctg ctg ttg cca cca ctg gee ggg cgc ccc ggg ata gee gta cgc cac tec gag cat tgg 
leu leu leu pro pro leu ala gly arg pro gly ile ala val arg his ser glu his trp 
601/201 631/211 

cgc gtt get cag ttc ggc ggc cga egg cag cgc cgt ggt gtc ggc ggc etc ggc ctg ttc 
arg val ala gin phe gly gly arg arg gin arg arg gly val gly gly leu gly leu phe 
661/221 691/231 

ggc tgc cgt tac etc gac ggc cgc gac cgc ctg cca gec gcg ccg ccg gat gtg etc cag 
gly cys arg tyr leu asp gly arg asp arg leu pro ala ala pro pro asp val leu gin 
721/241 751/251 

cca cat tgg ggc gcg caa agt etc ggt gee cct ggg gta gcg cat cgc gtc gac ata cac 
pro his trp gly ala gin ser leu gly ala pro gly val ala his arg val asp ile his 
781/261 811/271 

cgt cag ggc ate acc gag gcg gcg etc cat ate get ggg egg cag ate gat gag gaa tat 
arg gin gly ile thr glu ala ala leu his ile ala gly arg gin ile asp glu glu tyr 
841/281 871/291 

cgc caa cgc gcg gtg tec tec tea tgt gat gaa ccg atg cgt get tgc gca cca gta teg 
arg gin arg ala val ser ser ser cys asp glu pro met arg ala cys ala pro val ser 
901/301 931/311 

gac aag ccg atg agg ccg ccc gcg ctg gac ggg get tgt age gta tgg ccg ttt ccg etc 
asp lys pro met arg pro pro ala leu asp gly ala cys ser val trp pro phe pro leu 



SEQ ID NOS $794-804 

FIG. 45ZC 




961/321 
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991/331 
















age teg teg 


ctg 


egg 


cgc 


cgc 


egg 


gat 


aga 


ate gee cgc 


gaa 


cca 


gtg 
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egg 
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ser ser ser 
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arg 
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arg 


ile ala arg 


glu 
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val 


val 


arg 


arg 


arg 


lUzl/ o41 
















1U 31/ jjl 
















ttg acc teg 


tat 


cat 


ctg 


agt 


tag 


ttg 


ccc 


gcg caa tgg 


gca 


tec 


gcg 


tgt 


tat 


egg 


L. CI L. 


leu thr ser 


tyr 


his 


leu 


ser ) AMB ( leu 
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ser 
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lUol/Jbl 


SEQ ID WO: 802 






1111 / ***7 1 
1111/ j / 1 
















tac gtg aca 


gtc 


tgt 


egg 


caa 


gg a 


ggg 


acg 


cat gec act 


etc 


cga 


tea 


tga 


gca 


gcg 


y a l 


tyr val thr 


val 


cys 


arg 


gin 


gly 


gly 


urix 


Il-Lo did L.I1X. 


1 eu 


arg 


ser ) OPA(ala 


ala 


asp 


11/11 / *a o 1 
1141/ Jol 
















iiti / *a q i 

11/1/ jjI 




ID NO: 803— 






get tga cca 


gat 


cga 


gag 


cgc 


L- — X- 

CCu 


-r-, f- -3 

cca 


cgc 


cga aga tec 


caa 


gtt 


cgc 


ate 
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ugc 


ccg 


ala) OPA (piro 


asp 


arg 


glu 


arg 


ser 


leu 


arg 


arg arg ser 


gin 


val 
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glu 




pro 


ioai / /i n 1 _ 
lz U 1 / 4 U 1 ^ — - 


SEQ 


ID NO: 804 








ion /yiii 
iz Jl/ 411 
















tgg egg ggg 


ctt 


ccg 


cgc 


acc 


gac 


cgc 


gcg 


gcg gcg cct 


gca 


ggg 


cgc 


ggc 


gtt 


gct 


o = t- 


trp arg gly 


leu 


pro 


arg 


thr 


asp 


arg 


aia 


ala ala pro 


ala 


gly 


arg 


gly 


val 


V U.X 


his 


1261/421 
















Izy 1 / 4ol 
















cat egg tct 


ggg 


gat 


gtt 


ggt 


ttc 


egg 


cgt 


ggc gtt caa 


aga 


gac 


cat 


gat 


egg 


aag 




his arg ser 


gly 


asp 


val 


gly 


phe 


arg 


arg 


gly val gin 


arg 


asp 


his 


asp 


arg 


lys 


phe 


1 O O 1 I A A "\ 

±3Z±/ 441 
















1 O CI / yl CI i 
















ccc gat act 


cag 


cgt 


ttt 


egg 


ttt 


tgt 


cgt 


gat gtt egg 


tgg 


tgt 


ggt 


gta 


tgc 


cat 


cac 


pro asp thr 


gin 


arg 


phe 


arg 


phe 


cys 


arg 


asp val arg 


trp 


cys 


gly 


val 


cys 


his 


his 


1381/461 
















1411/471 
















egg tec teg 


gtt 


gtc 


egg 


cag 


gat 


gga 


teg 


tgg egg ate 


ggc 


tgc 


tgg 


ggc 


ttc 


gcg 


cca 


arg ser ser 


val 


val 


arg 


gin 


asp 


gly 


ser 


trp arg ile 


gly 


cys 


trp 


gly 


phe 


ala 


pro 


1441/481 
















1471/491 
















gcg teg tac 


caa 


ggg 


ggc 


egg 


ggg 


etc 


att 


cac cag ccg 


tat 


gga 


aga 


tc 








ala ser tyr 


gin 


gly 


gly 


arg 


gly 


leu 


ile 


his gin pro 


tyr 


gly 


arg) 









SEQ ID KTOSs794-804 (continued 1) 
i3o ^0£Aj (continued 1) 



SEQ ID NO: 805 

f seq 45ZA joined directly to phoA 

cag tct gtc ggc aag gag gga cgc atg cca etc tec gat cat gag cag egg 
( gin ser v al gly lys glu gly arg met pro leu ser asp his glu gin arg 
1141/381^ SEQ ID NO: 806 1171/391 



atg ctt gac 


cag 


ate 


gag 


age 


get 


etc 


tac 


gee gaa gat 


ccc 


aag 


ttc 


gca 


teg 


agt 


gtc 


met leu asp 


gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala glu asp 


pro 


lys 


phe 


ala 


ser 


ser 


val 


1201/401 
















1231/411 
















cgt ggc ggg 


ggc 


ttc 


cgc 


gca 


ccg 


acc 


gcg 


egg egg cgc 


ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


arg gly gly 


gly 


phe 


arg 


ala 


pro 


thr 


ala 


arg arg arg 


leu 


gin 


gly 


ala 


ala 


leu 


phe 


1261/421 
















1291/431 
















ate ate ggt 


ctg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


gtg gcg ttc 


aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ile ile gly 


leu 


gly 


met 


leu 


val 


ser 


gly 


val ala phe 


lys 


glu 


thr 


met 


ile 


gly 


ser 


1321/441 
















1351/451 
















ttc ccg ata 


etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg atg ttc 


ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


phe pro ile 


leu 


ser 


val 


phe 


gly 


phe 


val 


val met phe 


gly 


gly 


val 


val 


tyr 


ala 


ile 


1381/461 
















1411/471 
















acc ggt cct 


egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt ggc gga 


teg 


get 


get 


ggg 


get 


teg 


cgc 


thr gly pro 


arg 


leu 


ser 


gly 


arg 


met 


asp 


arg gly gly 


ser 


ala 


ala 


gly 


ala 


ser 


arg 


1441/481 
















1471/491 
















cag cgt cgt 


acc 


aag 


ggg 


gee 


ggg 


ggc 


tea 


ttc acc age 


cgt 


atg 


gaa 


gat 


c 






gin arg arg 


thr 


lys 


gly 


ala 


gly 


gly 


ser 


phe thr ser 


arg 


met 


glu 


asp) 







SEQ ID NOSs805-806 



FIG. 45A 




Sequence Rv2169c predicted by Cole et al . , 1998 (Nature 393:537-544) 
and containing Seq45A 

1/JL^SEQ ID NO: 807 31/11 



atg cca 


etc 


tec 


gat 


cat 


gag 


cag 


egg 


atg 


ctt gac 


cag 


ate 


gag 


age 


get 


etc 


tac 


gee 


(Met pro 


leu 


ser 


asp 


his 


glu 


gin 


arg 


met 


leu asp 


gin 


ile 


glu 


ser 


ala 


leu 


tyr 


ala 


O ± / Z± 


' SEQ 


ID NO: 808 










y l / 3 1 


















gaa gat 


ccc 


aag 


ttc 


gca 


teg 


agt 


gtc 


cgt 


ggc ggg 


ggc 


ttc 


cgc 


gca 


ccg 


acc 


gcg 


egg 


glu asp 


pro 


lys 


pne 


ala 


ser 


ser 


val 


arg 


gly gly 


gly 


phe 


arg 


ala 


pro 


thr 


ala 


arg 


121 /4 1 


















151/51 


















egg cgc 


ctg 


cag 


ggc 


gcg 


gcg 


ttg 


ttc 


ate 


ate ggt 


ctg 


ggg 


atg 


ttg 


gtt 


tec 


ggc 


gtg 


arg arg 


leu 


gin 


gly 


ala 


ala 


leu 


phe 


, He 


ile gly 


leu 


giy 


met 


leu 


val 


ser 


giy 


val 


lol/ 61 


















211/71 


















gcg ttc 


aaa 


gag 


acc 


atg 


ate 


gga 


agt 


ttc 


ccg ata 


etc 


age 


gtt 


ttc 


ggt 


ttt 


gtc 


gtg 


ala phe 


lys 


glu 


thr 


met 


ile 


gly 


ser 


phe 


pro ile 


leu 


ser 


val 


phe 


giy 


phe 


val 


val 


241/81 


















271/91 


















atg ttc 


ggt 


ggt 


gtg 


gtg 


tat 


gee 


ate 


acc 


ggt cct 


egg 


ttg 


tec 


ggc 


agg 


atg 


gat 


cgt 


met phe 


gly 


gly 


val 


val 


tyr 


ala 


ile 


thr 


gly pro 
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ser 


giy 


arg 


met 
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301/101 


















331/111 


















ggc gga 


teg 


get 


get 


ggg 


get 


teg 


cgc 


cag 


cgt cgt 


acc 


aag 


ggg 


gee 


ggg 


ggc 


tea 


ttc 


gly gly 


ser 


ala 


ala 


gly 


ala 


ser 


arg 


gin 


arg arg 


thr 
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giy 


ala 


gly 


gly 


ser 


phe 


361/121 


















391/131 


















acc age 


cgt 


atg 


gaa 


gat 


egg 


ttc 


egg 


cgc 
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gag 


taa 












thr ser 
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met 


glu 


asp 


arg 


phe 


arg 


arg 


arg phe 


asp 
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SEQ ID JSTOSs 807-808 

FIG. 45D 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv2169c 
1^1 ^SEQ ID MO: 809 31/11 

tga cag tct gtc ggc aag gag gga cgc atg cca etc tec gat cat gag cag egg atg ctt 

OPA(gln ser val gly lys glu gly arg met pro leu ser asp his glu gin arg met leu 
61/2 1^ SEQ ID NO: 810 91/31 

gac cag ate gag age get etc tac gee gaa gat ccc aag ttc gca teg agt gtc cgt ggc 

asp gin ile glu ser ala leu tyr ala glu asp pro lys phe ala ser ser val arg gly 
121/41 151/51 

ggg ggc ttc cgc gca ccg acc gcg egg egg cgc ctg cag ggc gcg gcg ttg ttc ate ate 

gly gly phe arg ala pro thr ala arg arg arg leu gin gly ala ala leu phe ile ile 
181/61 211/71 

ggt ctg ggg atg ttg gtt tec ggc gtg gcg ttc aaa gag acc atg ate gga agt ttc ccg 

gly leu gly met leu val ser gly val ala phe lys glu thr met ile gly ser phe pro 
241/81 271/91 

ata etc age gtt ttc ggt ttt gtc gtg atg ttc ggt ggt gtg gtg tat gee ate acc ggt 

ile leu ser val phe gly phe val val met phe gly gly val val tyr ala ile thr gly 
301/101 331/111 

cct egg ttg tec ggc agg atg gat cgt ggc gga teg get get ggg get teg cgc cag cgt 

pro arg leu ser gly arg met asp arg gly gly ser ala ala gly ala ser arg gin arg 
361/121 391/131 

cgt acc aag ggg gee ggg ggc tea ttc acc age cgt atg gaa gat egg ttc egg cgc. cgc 

arg thr lys gly ala gly gly ser phe thr ser arg met glu asp arg phe arg arg arg 
421/141 

ttc gac gag taa 
phe asp glu)0CH 

SEQ ID Nos 809-810 



FIG. 45E 



n . SEO ID MO- 



1/1 ^SEQ ID NO: 


811 










31/11 
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SEQ 


ID NO: 812 
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211/71 
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241/81 


















271/91 
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pro asp 


ala 


lys 


pro 


leu 


val 


glu 


glu 


ala 


301/101 


















331/111 


















acc gcg 


cag 


acc 


aag 


get 


etc 


aag 


age 


gcg 


cac atg 


gtg 


ctg 


acg 


gtc 


aac 


ggc 


aag 


ate 


thr ala 


gin 


thr 


lys 


ala 


leu 


lys 


ser 


ala 


his met 


val 


leu 


thr 


val 


asn 


gly 


lys 


ile) 



SEQ ID NOSs 811-812 

FIGc 46A 



1/1* SEQ ID 


NO: 


813 












31/11 


















age cgc 


gcc 


gca 


teg 


acc 


agg 


gcc 


tea 


cgc 


ccg gtc 


act 


tct 


ccg 


cgt 


tec 


tea 


aca 


att 


; ser arg 


ala 


ala 


ser 


thr 


arg 


ala 


ser 


arg 


pro val 


thr 


ser 


pro 


arg 


ser 


ser 


thr 


ile 


61/21"" ■ 


SEQ 


ID NO: 814 










91/31 


















ccg gtg 


aac 


ate 


gca 


cca 


ggt 


tag 


gca 


gca 


ate ccg 


egg 


acc 


cgc 


acc 


cca 


etc 


gcc 


gac 


pro val 


asn 


ile 


ala 


pro 


gly)AMB(ala 


ala 


ile pro 


arg 


thr 


arg 


thr 


pro 


leu 


ala 


asp 


121/41 






SEQ 


ID NO: 815 — 






151/51 


















egg cca 


act 


cac 


aga 


cac 


cct 


eta 


cga 


tgc 


agg gta 


tgc 


gga 


ccc 


cca 


gac 


gcc 


act 


gcc 


arg pro 


thr 


his 


arg 


his 


pro 


leu 


arg 


cys 


arg val 


cys 


gly 


pro 


pro 


asp 


ala 


thr 


ala 


181/61 


















211/71 


















gtc gca 


teg 


ccg 


tec 


teg 


ccg 


ccg 


tta 


gca 


teg ccg 


cca 


ctg 


teg 


ttg 


ccg 


get 


get 


cgt 


val ala 


ser 


pro 


ser 


ser 


pro 


pro 


leu 


ala 


ser pro 


pro 


leu 


ser 


leu 


pro 


ala 


ala 


arg 


241/81 


















271/91 


















egg get 


cga 


age 


caa 


gcg 


gcg 


gac 


cac 


ttc 


egg acg 


cga 


age 


cgc 


tgg 


teg 


agg 


agg 


cca 


arg ala 


arg 


ser 


gin 


ala 


ala 


asp 


his 


phe 


arg thr 


arg 


ser 


arg 


trp 


ser 


arg 


arg 


pro 


301/101 


















331/111 


















ccg cgc 


aga 


cca 


agg 


etc 


tea 


aga 


gcg 


cgc 


aca tgg 


tgc 


tga 


egg 


tea 


acg 


gca 


aga 


tc 


pro arg 


arg 


pro 


arg 


leu 


ser 


arg 


ala 


arg 


thr trp 


cys 


)OPA(arg 


ser 


thr 


ala 


arg) 





4— SEQ ID NO: 816 



SEQ ZD NOSs 813-816 

FIGo 4©B 




1/1 SEQ ID 


NO: 


817 












"3 1 /IT 


















gcc gcg 


ccg 


cat 


cga 


cca 


ggg 


cct 


cac 


gcc 


egg tea 


ct t 


etc 


cgc 


gtt 


cct 


caa 


caa 


ccc 


[ala ala 


pro 


his 


arg 


pro 


gly 


pro 


hi s 


axa 


arg ser 


leu 


leu 


arg 


vax 


pro 


gin 


gin 


phe 


61/21^ 


SEQ 


ID NO: 818 










Q 1 / "3 1 

y x / 5 x 


















egg tga 


aca 


teg 


cac 


cag 


gtt 


agg 


cag 


caa 


tec cgc 


gga 


ccc 


gca 


ccc 


cac 


teg 


ccg 


acc 


arg)OPA(thr 


ser 


his 


gin 


val 


arg 


gin 


gin 


ser arg 


gly 


pro 


axa 


pro 


his 


ser 


pro 




121/41 


— SEQ ID NO: 819 








1 C 1 /CI 


















ggc caa 


etc 


aca 


gac 


ace 


etc 


tac 


gat 


gca 


ggg tat 


gcg 


gac 


ccc 


cag 


acg 


cca 


ctg 


ccg 


gly gin 


leu 


thr 


asp 


thr 


leu 


tyr 


asp 


ala 


gly tyr 


axa 


asp 


pro 


gxn 


tnr 


pro 


leu 


pro 


181/61 


















Z XX / /X 


















teg cat 


cgc 


cgt 


cct 


cgc 


cgc 


cgt 


tag 


cat 


cgc cgc 


cac 


tgt 


cgt 


tgc 


egg 


ctg 


etc 


gtc 


ser his 


arg 


arg 


pro 


arg 


arg 


arg) AMB (his 


arg arg 


his 


cys 


arg 


cys 


arg 


leu 


leu 


val 


241/81 








SEQ 


ID NO: i 


B20 — 


* 


271/91 


















ggg etc 


gaa 


gee 


aag 


egg 


egg 


acc 


act 


tec 


gga cgc 


gaa 


gcc 


get 


ggt 


cga 


gga 


ggc 


cac 


gly leu 


glu 


ala 


lys 


arg 


arg 


thr 


thr 


ser 


gly arg 


glu 


ala 


ala 


gly 


arg 


gly 


gly 


his 


301/101 


















331/111 


















cgc gca 


gac 


caa 


ggc 


tct 


caa 


gag 


cgc 


gca 


cat ggt 


get 


gac 


ggt 


caa 


egg 


caa 


gat 


c 


arg ala 


asp 


gin 


gly 


ser 


gin 


glu 


arg 


ala 


his gly 


ala 


asp 


gly 


gin 


arg 


gin 


asp) 





SEQ ID NOSs 817-820 



FIG. 46C 




Coding sequence Rvl411c predicted by Cole et al, 
and containing seq46A: 



1998 (Nature 393: 537-544) 



1 /1 ^ SEQ ID NO: 821 31/11 

atg egg acc ccc aga cgc cac tgc cgt cgc ate gec gtc etc 
( Met a rg thr pro arg arg his cys arg arg ile ala val leu 
61/21^- — SEQ ID NO: 822 91/31 
gec act gtc gtt gec ggc tgc teg teg ggc teg aag cca age 
ala thr val val ala gly cys ser ser gly ser lys pro ser 
121/41 151/51 
gcg aag ccg ctg gtc gag gag gee acc gcg cag acc aag get 
ala lys pro leu val glu glu ala thr ala gin thr lys ala 
181/61 211/71 
gtg ctg acg gtc aac ggc aag ate ccg gga ctg tct ctg aag 
val leu thr val asn gly lys ile pro gly leu ser leu lys 
241/81 271/91 
acc acc aac ccc acc gec gcg acg gga aac gtc aag etc acg 
thr thr asn pro thr ala ala thr gly asn val lys leu thr 
301/101 331/111 
gat gee gac ttc gtg gtg ttc gac ggg ate ctg tac gec acc 
asp ala asp phe val val phe asp gly ile leu tyr ala thr 
361/121 391/131 
age gat ttc ggt ccc gee gee gac ate tac gac ccc gec cag 
ser asp phe gly pro ala ala asp ile tyr asp pro ala gin 
421/141 451/151 
ggc ctg gee aac gtg ctg gcg aat ttc gee gac gca aaa gec 
gly leu ala asn val leu ala asn phe ala asp ala lys ala 
481/161 511/171 
aac ggc cag aac acc ate cgc ate age ggg aag gta teg gca 
asn gly gin asn thr ile arg ile ser gly lys val ser ala 
541/181 571/191 
gcg ccg ccg ttc aac gcg acg cag ccg gtg ccg gcg acc gtc 
ala pro pro phe asn ala thr gin pro val pro ala thr val 
601/201 631/211 
gat cat caa ctg gca cag gee cag ttg gac cgc ggc teg ggc 
asp his gin leu ala gin ala gin leu asp arg gly ser gly 
661/221 691/231 
ttg teg aaa tgg ggc gag aag gtc cag gtc acg aag ccc ccg 
leu ser lys trp gly glu lys val gin val thr lys pro pro 



gee 
ala 

ggc 

gly 

etc 
leu 

acg 
thr 

ctg 
leu 

ctg 
leu 

gtg 
val 

gaa 
glu 

cag 
gin 

tgg 
trp 

aat 
asn 

gtg 

val 



gee gtt 
ala val 

gga cca 
gly pro 

aag age 
lys ser 

ctg age 
leu ser 

ggt ggg 
gly gly 

acg ccc 
thr pro 

ctg aat 
leu asn 

ggg egg 
gly arg 

gcg gtg 
ala val 

att cag 
ile gin 

tec gtc 
ser val 

age tga 
ser)OPA 



age ate gee 
ser ile ala 

ctt ccg gac 
leu pro asp 

gcg cac atg 
ala his met 

ggc gat etc 
gly asp leu 

tct gat ate 
ser asp ile 

aac cag tgg 
asn gin trp 

ccg gat acc 
pro asp thr 

gat acc ate 
asp thr ile 

aac cag ata 
asn gin ile 

gag acc ggc 
glu thr gly 

cag atg acc 
gin met thr 



SEQ ID MOSs 821-822 



FIGo 46D 




ORF according to Cole et al . , 1998 (Nature 393: 537-544): 
and containing the coding sequence Rvl411c: 



1/1___^SEQ ID NO: 823 

tag etc acc cag gtt gga ccg 

AMB (leu thr gin val gly pro 

61/21^SEQ ID NO: 824 

gtc ggg caa tac ate gac gac 

val gly gin tyr ile asp asp 

121/41 

gec ggc gtc ggc ggt aac cat 
ala gly val gly gly asn his 
181/61 

etc acg ccc ggt cac ttc tec 
leu thr pro gly his phe ser 
241/81 

ggc age aat ccc gcg gac ccg 
gly ser asn pro ala asp pro 
301/101 

acg atg cag ggt atg egg acc 
thr met gin gly met arg thr 
361/121 

gtt age ate gee gee act gtc 
val ser ile ala ala thr val 
421/141 

cca ctt ccg gac gcg aag ccg 
pro leu pro asp ala lys pro 
481/161 

age gcg cac atg gtg ctg acg 
ser ala his met val leu thr 
541/181 

age ggc gat etc acc acc aac 
ser gly asp leu thr thr asn 
601/201 

ggg tct gat ate gat gee gac 
gly ser asp ile asp ala asp 
661/221 

ccc aac cag tgg age gat ttc 
pro asn gin trp ser asp phe 
721/241 

aat ccg gat acc ggc ctg gee 
asn pro asp thr gly leu ala 
781/261 

egg gat acc ate aac ggc cag 
arg asp thr ile asn gly gin 
841/281 

gtg aac cag ata gcg ccg ccg 
val asn gin ile ala pro pro 
901/301 

cag gag acc ggc gat cat caa 
gin glu thr gly asp his gin 
961/321 

gtc cag atg acc ttg teg aaa 

val gin met thr leu ser lys 

1021/341 

tga 

OPA 



31/11 





cag 




t_.i_.t-. y y i_ 


4- 


Vci -L 


gin 


cys 


1 All /T 1 

X fcr H y iy 


hi s 








Q 1 / ^ 1 




eg t 


cag 


aca 


v— cl y <^ 


gtt 


arg 


gin 


tnr 


his ala 


Vdx 








1 CI /CI 

X D X / D X 




egg 


ac c 


gcg 


gat ggt 


cag 


arg 


enr 


a i.a 


asp gly 


gin 








^11/ / X 




gcg 


4- 4- r+ 


etc 


aac aat 


tec 


ala 


pne 


leu 


asn asn 


ser 








z /!/ yi 




cac 


ccc 


ac t 


cgc cga 


ccg 


ni s 


pro 


4- Vi -v- 

tnr 


arg arg 


pro 








ill / 1 1 1 

D .3 X / X X X 




ccc 


aga 


cgc 


cac tgc 


cgt 


pro 


arg 


arg 


his cys 


arg 














gec 


ggc 


tgc teg 


teg 


val 


axa 


gly 


cys ser 


ser 








/IC1 /I CI 




ctg 


gtc 


gag 


gag gec 


acc 


leu 


val 


glu 


glu ala 


1-"h-r 








D X X / X / X 




gtc 


aac 


ggc 


aag ate 


ccg 


va jl 


asn 


gly 


lys ile 


pro 








S71 /1 Q1 

~) 1 X / X _7 X 




ccc 


ac c 


gec 


r~r r~* r~x /*t 

yoy at_y 


gg a 


pro 


unr 


ala 


glXcl uxii 


gly 








631/211 




L- L- v.. 


gtg 


gtg 


+- t~ f~> fT=3 C 
L- L- v_- yav 


yyy 


phe 


val 


val 




(~f "I "V 








V J X / <&■ J X 






ccc 


y 


y y a» 


ate 


gly 


pro 


d -L d 


ala asp 


x xts 








7 51/251 




aac 


y ^y 


ctg 


^ t~ 

y y oa i_ 


ttc 


asn 


va 1 


leu 


al a =i cz*n 


phe 








811/271 




aac 


acc 


ate 


cgc ate 


age 


asn 


thr 


ile 


arg ile 


ser 








871/291 




ttc 


aac 


gcg 


acg cag 


ccg 


phe 


asn 


ala 


thr gin 


pro 








931/311 




ctg 


gca 


cag 


gec cag 


ttg 


leu 


ala 


gin 


ala gin 


leu 








991/331 




tgg 


ggc 


gag 


aag gtc 


cag 


trp 


gly 


glu 


lys val 


gin 



cac gtc ggc ggt gaa ttg gee 
his val gly gly glu leu ala 

gac age gat cga gtc gec gtg 
asp ser asp arg val ala val 

ccg cgc cgc ate gac cag ggc 
pro arg arg ile asp gin gly 

ggt gaa cat cgc acc agg tta 
gly glu his arg thr arg leu 

gee aac tea cag aca ccc tct 
ala asn ser gin thr pro ser 

cgc ate gec gtc etc gec gec 
arg ile ala val leu ala ala 

ggc teg aag cca age ggc gga 
gly ser lys pro ser gly gly 

gcg cag acc aag get etc aag 
ala gin thr lys ala leu lys 

gga ctg tct ctg aag acg ctg 
gly leu ser leu lys thr leu 

aac gtc aag etc acg ctg ggt 
asn val lys leu thr leu gly 

ate ctg tac gec acc ctg acg 
ile leu tyr ala thr leu thr 

tac gac ccc gec cag gtg ctg 
tyr asp pro ala gin val leu 

gec gac gca aaa gec gaa ggg 
ala asp ala lys ala glu gly 

ggg aag gta teg gca cag gcg 
gly lys val ser ala gin ala 

gtg ccg gcg acc gtc tgg att 
val pro ala thr val trp ile 

gac cgc ggc teg ggc aat tec 
asp arg gly ser gly asn ser 

gtc acg aag ccc ccg gtg age 
val thr lys pro pro val ser) 



SEQ ID NOSs 823-824 



FlOo 46E 




1/1 SEQ ID NO: 825 



gag ctg gtc aac ggc gcc ggc ate gac 

(gluleu val asn gly ala gly ile asp 

6l72T~^SEQ ID NO: 826 

ctg gcc gat gcc cag cag atg gtc gag 

leu ala asp ala gin gin met val glu 
121/41 

gtg ttg gtg gcc teg ggc age aac cat 

val leu val ala ser gly ser asn his 
181/61 

ttc gac get gtg atg gac gcg aac gtg 

phe asp ala val met asp ala asn val 
241/81 

egg gtg ctg etc gag cag ggt cag ggc 

arg val leu leu glu gin gly gin gly 
301/101 

ggg ttg ggc aat gcc gcc ggt tac age 

gly leu gly asn ala ala gly tyr ser 



31/11 

gac gcc gcc gtc gtg acc tgc egg ccg gac age 
asp ala ala val val thr cys arg pro asp ser 
91/31 

gcg gca ctg ggc cga tat ggc cgt ttg gac gga 
ala ala leu gly arg tyr gly arg leu asp gly 
151/51 

gtg gcg ccc att acc gag atg gcc gtc gag gac 
val ala pro ile thr glu met ala val glu asp 
211/71 

egg ggt gcc tgg ctg gtg tgt egg gcg gcc gga 
arg gly ala trp leu val cys arg ala ala gly 
271/91 

ggc age gtg gtg ctg gtg teg tec gtt cgc ggc 
gly ser val val leu val ser ser val arg gly 
331/111 

gcg tac tgc ccg teg aag gcg ggc acc gat c 
ala tyr cys pro ser lys ala gly thr asp) 



SEQ ID NOSs 825-826 



l/l_^ SEQ ID NO: 


» 827 








age tgg 


tea 


acg 


gcg 


ccg 


gca 


teg 


acg 


(ser trp 


ser 


thr 


ala 


pro 


ala 


ser 


thr 


61/21^- 


SEQ 


ID NO: 828 








tgg ccg 


atg 


ccc 


age 


aga 


tgg 


teg 


agg 


trp pro 


met 


pro 


ser 


arg 


trp 


ser 


arg 


121/41 
















tgt tgg 


tgg 


cct 


egg 


gca 


gca 


acc 


atg 


cys trp 


trp 


pro 


arg 


ala 


ala 


thr 


met 


181/61 
















teg acg 


ctg 


tga 


tgg 


acg 


cga 


acg 


tgc 


ser thr 


leu)OPA(trp 


thr 


arg 


thr 


cys 


241/81 






*L-SEQ ID NO 


: 830 


ggg tgc 


tgc 


teg 


age 


agg 


gtc 


agg 


gcg 


gly cys 


cys 


ser 


ser 


arg 


val 


arg 


ala 


301/101 
















ggt tgg 


gca 


atg 


ccg 


ccg 


gtt 


aca 


gcg 


gly trp 


ala 


met 


pro 


pro 


val 


thr 


ala 



31/11 

acg ccg ccg teg tga cct gcc ggc egg aca gcc 
thr pro pro ser)OPA(pro ala gly arg thr ala 

91/31 C^SEQ ID NO: 829 

egg cac tgg gcc gat atg gcc gtt tgg acg gag 
arg his trp ala asp met ala val trp thr glu 

151/51 

tgg cgc cca tta ccg aga tgg ccg teg agg act 
trp arg pro leu pro arg trp pro ser arg thr 
211/71 

ggg gtg cct ggc tgg tgt gtc ggg egg ccg gac 
gly val pro gly trp cys val gly arg pro asp 
271/91 

gca gcg tgg tgc tgg tgt cgt ccg ttc gcg gcg 
ala ala trp cys trp cys arg pro phe ala ala 
331/111 

cgt act gcc cgt cga agg egg gca ccg ate 
arg thr ala arg arg arg arg ala pro ile) 



SEQ ID NOSs 827-830 



V3o 4lD 



MAR 2 8 2283 8 



1/1. SEQ ID 


NO: 


831 












31/11 


















get yyt 


caa 


egg 


cgc 


egg 


cat 


cga 


cga 


cgc 


cgc cgt 


cgt 


gac 


ctg 


ccg 


gee 


gga 


cag 


cct 


V ci -Lcl yly 


gin 


arg 


arg 


arg 


his 


arg 


arg 


arg 


arg arg 


arg 


asp 


leu 


pro 


ala 


crlv 


gin 


pro 




• SEQ ID NO: 832 








91/31 


















yy<- v-yd 


tgc 


cca 


gca 


gat 


ggt 




yy 1 - 


y y ( - 


u. yyy 


cca 
<-t-y 


at a 


t" CI CI 


ccg 


ttt 




carer 


acrt 


gly arg 


cys 


pro 


ala 


asp 


gly 


arg 


y x 


y x 


thr gly 


pro 


ile 




pro 


phe 


gly 


arg 


ser 


121/41 


















151/51 


















gtt ggt 


ggc 


etc 


ggg 


cag 


caa 


cca 


tgt 


ggc 


gec cat 


tac 


cga 


gat 


ggc 


cgt 


cga 


gga 


ctt 


val gly 


gly 


leu 


gly 


gin 


gin 


pro 


cys 


gly 


ala his 


tyr 


arg 


asp 


gly 


arg 


arg 


gly 


leu 


181/61 


















211/71 


















cga cgc 


tgt 


gat 


gga 


cgc 


gaa 


cgt 


gcg 


ggg 


tgc ctg 


get 


ggt 


gtg 


teg 


ggc 


ggc 


egg 


acg 


arg arg 


cys 


asp 


gly 


arg 


glu 


arg 


ala 


gly 


cys leu 


ala 


gly 


val 


ser 


gly 


gly 


arg 


thr 


241/81 


















271/91 


















ggt get 


get 


cga 


gca 


ggg 


tea 


ggg 


egg 


cag 


cgt ggt 


get 


ggt 


gtc 


gtc 


cgt 


teg 


egg 


egg 


gly ala 


ala 


arg 


ala 


gly 


ser 


gly 


arg 


gin 


arg gly 


ala 


gly 


val 


val 


arg 


ser 


arg 


arg 


301/101 


















331/111 


















gtt ggg 


caa 


tgc 


cgc 


egg 


tta 


cag 


cgc 


gta 


ctg ccc 


gtc 


gaa 


ggc 


ggg 


cac 


cga 


tc 




val gly 


gin 


cys 


arg 


arg 


leu 


gin 


arg 


val 


leu pro 


val 


glu 


gly 


gly 


his 


arg) 





SEQ ID NOSs 831-832 



Coding sequence Rvl714 predicted by Cole et al., 1998 (Nature 393: 537-544) 
and containing seq 47A: 



1/1 SEQ ID NO: 833 

gtg gag gaa atg gcg ctg 
( val g lu glu met ala leu 
61/21 " — SEQ ID NO: 834 
cag gac aag teg ate ctg 
gin asp lys ser ile leu 
121/41 

gcg ctg gec gac gcg gga 
ala leu ala asp ala gly 
181/61 

gag ctg gtc aac ggc gec 
glu leu val asn gly ala 
241/81 

ctg gec gat gec cag cag 
leu ala asp ala gin gin 
301/101 

gtg ttg gtg gee teg ggc 
val leu val ala ser gly 
361/121 

ttc gac get gtg atg gac 
phe asp ala val met asp 
421/141 

egg gtg ctg etc gag cag 
arg val leu leu glu gin 
481/161 

ggg ttg ggc aat gee gee 
gly leu gly asn ala ala 
541/181 

ttg gee aag aca ttg gcg 
leu ala lys thr leu ala 
601/201 

ccg acg gtg ttt egg tec 
pro thr val phe arg ser 
661/221 

gee ace egg gag gcg atg 
ala thr arg glu ala met 
721/241 

ttc gtc ggc gee ctg ate 
phe val gly ala leu ile 
781/261 

atg tat ctg gac ggc ggg 
met tyr leu asp gly gly 



31/11 

get cag cag gtg ccg aac ctg 
ala gin gin val pro asn leu 
91/31 

ate acc ggc gcg ace ggt teg 
ile thr gly ala thr gly ser 
151/51 

gcg egg ctg aca ctg gee ggc 
ala arg leu thr leu ala gly 
211/71 

ggc ate gac gac gee gee gtc 
gly ile asp asp ala ala val 
271/91 

atg gtc gag gcg gca ctg ggc 
met val glu ala ala leu gly 
331/111 

age aac cat gtg gcg ccc att 
ser asn his val ala pro ile 
391/131 

gcg aac gtg egg ggt gec tgg 
ala asn val arg gly ala trp 
451/151 

ggt cag ggc ggc age gtg gtg 
gly gin gly gly ser val val 
511/171 

ggt tac age gcg tac tgc ccg 
gly tyr ser ala tyr cys pro 
571/191 

gec gaa tgg ggc ggt cac ggc 
ala glu trp gly gly his gly 
631/211 

gcg gtg acc gag tgg atg ttc 
ala val thr glu trp met phe 
691/231 

etc gee egg ate ccg ttg cgc 
leu ala arg ile pro leu arg 

751/251 
tat ctg etc age gac gee teg 
tyr leu leu ser asp ala ser 

811/271 
tac acc gca tgc tga 
tyr thr ala cys)OPA 



SEQ ID NOS 2 833-834 



ggt ctg gcg cgc ttc age gtg 
gly leu ala arg phe ser val 

ttg ggc cga gtt gec gee egg 
leu gly arg val ala ala arg 

ggc aac teg gec ggt ctg gee 
gly asn ser ala gly leu ala 

gtg acc tgc egg ccg gac age 
val thr cys arg pro asp ser 

cga tat ggc cgt ttg gac gga 
arg tyr gly arg leu asp gly 

acc gag atg gee gtc gag gac 
thr glu met ala val glu asp 

ctg gtg tgt egg gcg gee gga 
leu val cys arg ala ala gly 

ctg gtg teg tec gtt cgc ggc 
leu val ser ser val arg gly 

teg aag gcg ggc acc gat ctg 
ser lys ala gly thr asp leu 

att egg gtg aac gcg ctg gcg 
ile arg val asn ala leu ala 

acc gac gat ccg aag ggc egg 
thr asp asp pro lys gly arg 

cgc ttc gec gaa ccg gaa gac 
arg phe ala glu pro glu asp 

age ttc tac acc ggc cag gtg 
ser phe tyr thr gly gin val 



FIG, 47D 




ORF according to Cole et al . , 1998 {Nature 393: 537-544) and containing 
the coding sequence Rvl714 : 

2 4/1 ^ SEQ ID NO: 835 54/11 

tag gtg gag gaa atg gcg ctg get cag cag gtg ccg aac ctg ggt ctg gcg cgc ttc age 
AMB (val glu glu met ala leu ala gin gin val pro asn leu gly leu ala arg phe ser 
84/21^ SEQ ID NO: 836 114/31 

gtg cag gac aag teg ate ctg ate ace ggc gcg acc ggt teg ttg ggc cga gtt gee gee 
val gin asp lys ser ile leu ile thr gly ala thr gly ser leu gly arg val ala ala 
144/41 174/51 

egg gcg ctg gee gac gcg gga gcg egg ctg aca ctg gec ggc ggc aac teg gee ggt ctg 
arg ala leu ala asp ala gly ala arg leu thr leu ala gly gly asn ser ala gly leu 
204/61 234/71 

gec gag ctg gtc aac ggc gec ggc ate gac gac gee gec gtc gtg acc tgc egg ccg gac 
ala glu leu val asn gly ala gly ile asp asp ala ala val val thr cys arg pro asp 
264/81 294/91 

age ctg gee gat gec cag cag atg gtc gag gcg gca ctg ggc cga tat ggc cgt ttg gac 
ser leu ala asp ala gin gin met val glu ala ala leu gly arg tyr gly arg leu asp 
324/101 354/111 

gga gtg ttg gtg gee teg ggc age aac cat gtg gcg ccc att acc gag atg gec gtc gag 
gly val leu val ala ser gly ser asn his val ala pro ile thr glu met ala val glu 
384/121 414/131 

gac ttc gac get gtg atg gac gcg aac gtg egg ggt gee tgg ctg gtg tgt egg gcg gec 
asp phe asp ala val met asp ala asn val arg gly ala trp leu val cys arg ala ala 
444/141 474/151 

gga egg gtg ctg etc gag cag ggt cag ggc ggc age gtg gtg ctg gtg teg tec gtt cgc 
gly arg val leu leu glu gin gly gin gly gly ser val val leu val ser ser val arg 
504/161 534/171 

ggc ggg ttg ggc aat gee gee ggt tac age gcg tac tgc ccg teg aag gcg ggc acc gat 
gly gly leu gly asn ala ala gly tyr ser ala tyr cys pro ser lys ala gly thr asp 
564/181 594/191 

ctg ttg gee aag aca ttg gcg gec gaa tgg ggc ggt cac ggc att egg gtg aac gcg ctg 
leu leu ala lys thr leu ala ala glu trp gly gly his gly ile arg val asn ala leu 
624/201 654/211 

gcg ccg acg gtg ttt egg tec gcg gtg acc gag tgg atg ttc acc gac gat ccg aag ggc 
ala pro thr val phe arg ser ala val thr glu trp met phe thr asp asp pro lys gly 
684/221 714/231 

egg gec acc egg gag gcg atg etc gee egg ate ccg ttg cgc cgc ttc gee gaa ccg gaa 
arg ala thr arg glu ala met leu ala arg ile pro leu arg arg phe ala glu pro glu 
744/241 774/251 

gac ttc gtc ggc gec ctg ate tat ctg etc age gac gee teg age ttc tac acc ggc cag 
asp phe val gly ala leu ile tyr leu leu ser asp ala ser ser phe tyr thr gly gin 
804/261 834/271 
gtg atg tat ctg gac ggc ggg tac acc gca tgc tga 
val met tyr leu asp gly gly tyr thr ala cys)OPA 



SEQ ID NOSs 835-836 



FIGo 4TE 




1/1 _ ^SEQ ID NO: 837 31/11 

agg etc atg age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg acc acc gec 

( arg le u met ser lys thr val leu ile leu gly ala gly val gly gly leu thr thr ala 

61/ 21^- SEQ ID NO: 838 91/31 

gac acc etc cgt caa ctg eta cca cct gag gat c 

asp thr leu arg gin leu leu pro pro glu asp) 



SEQ ID NOSs 837-838 

FIG. 48A 



1/ 1 ^ SEQ ID NO: 839 31/11 

ggc tea tga gca aga egg ttc tea tec ttg gcg egg gtg teg gcg gec tga cca ccg ccg 
gly ser OPA(ala arg arg phe ser ser leu ala arg val ser ala ala)OPA(pro pro pro 

61/21 C-SEQ ID NO: 840 91/31 SEQ ID NO: 841 * 

aca ccc tec gtc aac tgc tac cac ctg agg ate 
thr pro ser val asn cys tyr his leu arg ile) 



SEQ ID NOS: 839-841 

FIG. 48B 



1/1 ^ SEQ ID NO: 842 31/11 

get cat gag caa gac ggt tct cat cct tgg cgc ggg tgt egg egg cct gac cac cgc cga 
( ala hi s glu gin asp gly ser his pro trp arg gly cys arg arg pro asp his arg arg 
61/21"^ SEQ ID NO: 843 

cac cct ccg tea act get acc acc tga gga tc 
his pro pro ser thr ala thr thr)OPA gly 



SEQ ID NOS: 842-843 



FIG. 48C 



26 W33 % 



Coding sequence Rv0331 predicted by Cole et al . , 1998 (Nature 393: 
537-544) and containing seq48A: 

1^_SEQ ID NO: 844 31/11 

atg age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg ace acc gee gac acc 
( Met se r lys thr val leu ile leu gly ala gly val gly gly leu thr thr ala asp thr 
61/21**" SEQ ID NO: 845 91/31 

etc cgt caa ctg eta cca cct gag gat cga ate ata ttg gtg gac agg age ttt gac ggg 
leu arg gin leu leu pro pro glu asp arg ile ile leu val asp arg ser phe asp gly 
121/41 151/51 

acg ctg ggc ttg teg ttg eta tgg gtg ttg egg ggc tgg egg egg cct gac gac gtc cgc 
thr leu gly leu ser leu leu trp val leu arg gly trp arg arg pro asp asp val arg 
181/61 211/71 

gtc cgc ccc acc gcg gcg teg ctg ccc ggt gtg gaa atg gtt act gca acc gtc gee cac 
val arg pro thr ala ala ser leu pro gly val glu met val thr ala thr val ala his 
241/81 271/91 

att gac ate gcg gec cag gta gtg cac acc gac aac age gtc ate ggc tat gac gcg ttg 
ile asp ile ala ala gin val val his thr asp asn ser val ile gly tyr asp ala leu 
301/101 331/111 

gtg ate gca tta ggt gcg gcg ctg aac acc gac gee gtt ccc gga ctg teg gac gcg etc 
val ile ala leu gly ala ala leu asn thr asp ala val pro gly leu ser asp ala leu 
361/121 391/131 

gac gec gac gtc gcg ggc cag ttc tac acc ctg gac ggc gcg get gag ctg cgt gcg aag 
asp ala asp val ala gly gin phe tyr thr leu asp gly ala ala glu leu arg ala lys 
421/141 451/151 

gtc gag gcg etc gag cat ggc egg ate get gtg get ate gec ggg gtg ccg ttc aaa tgc 
val glu ala leu glu his gly arg ile ala val ala ile ala gly val pro phe lys cys 
481/161 511/171 

cca gec gca ccg ttc gaa gcg gcg ttt ctg ate gec gec caa etc ggt gac cgc tac gee 
pro ala ala pro phe glu ala ala phe leu ile ala ala gin leu gly asp arg tyr ala 
541/181 571/191 

acc gga acc gta cag ate gac acg ttc acg cct gac ccg ctg ccg atg ccc gtt gca ggt 
thr gly thr val gin ile asp thr phe thr pro asp pro leu pro met pro val ala gly 
601/201 631/211 

ccc gag gtc ggc gag get ttg gtc teg atg etc aag gat cac ggt gtc ggc ttc cat cct 
pro glu val gly glu ala leu val ser met leu lys asp his gly val gly phe his pro 
661/221 691/231 

cgc aag gee eta get cgc gtc gat gag gee gca agg acg atg cac ttc ggt gac ggc acg 
arg lys ala leu ala arg val asp glu ala ala arg thr met his phe gly asp gly thr 
721/241 751/251 

tec gaa ccg ttc gat ctg ctt gee gtg gtc ccc ccg cac gtg ccc tec gec gcg gcg egg 
ser glu pro phe asp leu leu ala val val pro pro his val pro ser ala ala ala arg 
781/261 811/271 

tea gcg ggt etc age gaa tec ggg tgg ata ccc gtg gac ccg cgc acc ctg tec act age 
ser ala gly leu ser glu ser gly trp ile pro val asp pro arg thr leu ser thr ser 
841/281 871/291 

gee gac aac gtg tgg gec ate ggc gat gcg acc gtg ctg acg ctg ccg aat ggc aaa ccg 
ala asp asn val trp ala ile gly asp ala thr val leu thr leu pro asn gly lys pro 
901/301 931/311 

ctg ccc aag get gec gtg ttc gee gaa gee cag gee gca gtt gtc gee cac ggc gtc gee 
leu pro lys ala ala val phe ala glu ala gin ala ala val val ala his gly val ala 
961/321 991/331 

cgc cat etc ggt tac gac gta get gag cgc cac ttc acc ggc acg ggc gec tgc tac gtc 
arg his leu gly tyr asp val ala glu arg his phe thr gly thr gly ala cys tyr val 
1021/341 1051/351 

gag acc ggt gat cac cag gca gec aag ggc gac ggc gat ttc ttc get ccg teg gcg ccc 
glu thr gly asp his gin ala ala lys gly asp gly asp phe phe ala pro ser ala pro 
1081/361 1111/371 

teg gtg acg ctg tac ccg ccg teg egg gag ttt cac gag gag aag gtc gca caa gaa ctg 
ser val thr leu tyr pro pro ser arg glu phe his glu glu lys val ala gin glu leu 
1141/381 

gee tgg ctg acc cgc tgg aag acg tga 
ala trp leu thr arg trp lys thr)OPA 

SEQ ID NOSs 844-845 



FIG. 48D 



ORF according to Cole et al . , 1998 (Nature 393: 537-544) and containing 
coding sequence Rv0331: 



1/l^SEQ ID NO: 846 31/11 

tga aca ccc gcg ccg acg egg cga caa teg egg aaa acc ggt ccg egg gaa tgc tgc ggg 
OPA(thr pro ala pro thr arg arg gin ser arg lys thr gly pro arg glu cys cys gly 
61/21*^SEQ ID NO: 847 91/31 

cca tgg gee gat aat agt ttg act gac teg gtc agt cac ccc aag acc ttg cgc aag act 
pro trp ala asp asn ser leu thr asp ser val ser his pro lys thr leu arg lys thr 
121/41 151/51 

gcg gcg gaa tct aat att cca aag ata tat gga act cga tgc gaa gga ate agg etc atg 
ala ala glu ser asn ile pro lys ile tyr gly thr arg cys glu gly ile arg leu met 
181/61 211/71 

age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg acc acc gee gac acc etc 
ser lys thr val leu ile leu gly ala gly val gly gly leu thr thr ala asp thr leu 
241/81 271/91 

cgt caa ctg eta cca cct gag gat cga ate ata ttg gtg gac agg age ttt gac ggg acg 
arg gin leu leu pro pro glu asp arg ile ile leu val asp arg ser phe asp gly thr 
301/101 331/111 

ctg ggc ttg teg ttg eta tgg gtg ttg egg ggc tgg egg egg cct gac gac gtc cgc gtc 
leu gly leu ser leu leu trp val leu arg gly trp arg arg pro asp asp val arg val 
361/121 391/131 

cgc ccc acc gcg gcg teg ctg ccc ggt gtg gaa atg gtt act gca acc gtc gee cac att 
arg pro thr ala ala ser leu pro gly val glu met val thr ala thr val ala his ile 
421/141 451/151 

gac ate gcg gee cag gta gtg cac acc gac aac age gtc ate ggc tat gac gcg ttg gtg 
asp ile ala ala gin val val his thr asp asn ser val ile gly tyr asp ala leu val 
481/161 511/171 

ate gca tta ggt gcg gcg ctg aac acc gac gee gtt ccc gga ctg teg gac gcg etc gac 
ile ala leu gly ala ala leu asn thr asp ala val pro gly leu ser asp ala leu asp 
541/181 571/191 

gee gac gtc gcg ggc cag ttc tac acc ctg gac ggc gcg get gag ctg cgt gcg aag gtc 
ala asp val ala gly gin phe tyr thr leu asp gly ala ala glu leu arg ala lys val 
601/201 631/211 

gag gcg etc gag cat ggc egg ate get gtg get ate gee ggg gtg ccg ttc aaa tgc cca 
glu ala leu glu his gly arg ile ala val ala ile ala gly val pro phe lys cys pro 
661/221 691/231 

gee gca ccg ttc gaa gcg gcg ttt ctg ate gee gee caa etc ggt gac cgc tac gee acc 
ala ala pro phe glu ala ala phe leu ile ala ala gin leu gly asp arg tyr ala thr 
721/241 751/251 

gga acc gta cag ate gac acg ttc acg cct gac ccg ctg ccg atg ccc gtt gca ggt ccc 
gly thr val gin ile asp thr phe thr pro asp pro leu pro met pro val ala gly pro 
781/261 811/271 

gag gtc ggc gag get ttg gtc teg atg etc aag gat cac ggt gtc ggc ttc cat cct cgc 
glu val gly glu ala leu val ser met leu lys asp his gly val gly phe his pro arg 
841/281 - 871/291 

aag gee eta get cgc gtc gat gag gee gca agg acg atg cac ttc ggt gac ggc acg tec 
lys ala leu ala arg val asp glu ala ala arg thr met his phe gly asp gly thr ser 



SEQ ID NOSs 846-847 



FIG. 48 



-6 \ 



901/301 

gaa ccg ttc gat ctg ctt gcc gtg gtc ccc 
glu pro phe asp leu leu ala val val pro 
961/321 

gcg ggt etc age gaa tec ggg tgg ata ccc 
ala gly leu ser glu ser gly trp ile pro 
1021/341 

gac aac gtg tgg gcc ate ggc gat gcg acc 
asp asn val trp ala ile gly asp ala thr 
1081/361 . 

ccc aag get gcc gtg ttc gcc gaa gcc cag 
pro lys ala ala val phe ala glu ala gin 
1141/381 

cat etc ggt tac gac gta get gag cgc cac 
his leu gly tyr asp val ala glu arg his 
1201/401 

acc ggt gat cac cag gca gcc aag ggc gac 
thr gly asp his gin ala ala lys gly asp 
1261/421 

gtg acg ctg tac ccg ccg teg egg gag ttt 
val thr leu tyr pro pro ser arg glu phe 
1321/441 

tgg ctg acc cgc tgg aag acg tga 
trp leu thr arg trp lys thr)OPA 



931/311 

ccg cac gtg ccc tec gcc gcg gcg egg tea 

pro his val pro ser ala ala ala arg ser 
991/331 

gtg gac ccg cgc acc ctg tec act age gcc 

val asp pro arg thr leu ser thr ser ala 
1051/351 

gtg ctg acg ctg ccg aat ggc aaa ccg ctg 

val leu thr leu pro asn gly lys pro leu 
1111/371 

gcc gca gtt gtc gcc cac ggc gtc gcc cgc 

ala ala val val ala his gly val ala arg 
1171/391 

ttc acc ggc acg ggc gcc tgc tac gtc gag 

phe thr gly thr gly ala cys tyr val glu 
1231/411 

ggc gat ttc ttc get ccg teg gcg ccc teg 

gly asp phe phe ala pro ser ala pro ser 
1291/431 

cac gag gag aag gtc gca caa gaa ctg gcc 

his glu glu lys val ala gin glu leu ala 



SEQ ZD NOSs 846-847 (continued) 

FIG. 48E (continued) 



Fragment amplified by PCR based on the sequence similarities with a 
serine protease of the E.coli htrA family (creation of the BamHI site at 
the 5' end and of the SnaBI site at the 3' end) and subcloned into the 
vector pJVED: 

1/JL_^~SEQ ID NO: 848 31/11 

cca tct aca ccg etc aac age egg gee aga cgc tgc egg teg gtg ctg ccg aga agg egg 
(proser thr pro leu asn ser arg ala arg arg cys arg ser val leu pro arg arg arg) 
6T72T^SEQ ID NO: 849 91/31 

tga tec gtg gcg agt tgt tea tgt cgc ggc gca cca ccg ccg acc aac ggg tgc ttg cca 
OPA(ser val ala ser cys ser cys arg gly ala pro pro pro thr asn gly cys leu pro 
121/*4T^SEQ ID NO: 850 SEQ ID NO: 852\ 151/51 

tec gtc tga cca acg gta gtt cgc tgc tga\tct cca aaa gtc tea age cca ccg aag cag 
ser val)OPA(pro thr val val arg cys)OPA(ser pro lys val ser ser pro pro lys gin 
181/61 ^SEQ ID NO: 851 211/71 

tea tga aca age tgc gtt ggg tgc tat tga teg tgg gtg gga teg ggg tgg egg teg ccg 
ser)OPA(thr ser cys val gly cys tyr)OPA( ser trp val gly ser gly trp arg ser pro 
241/81 ^ — SEQ ID NO: 853 271/91^SEQ ID NO: 854 

egg tgg ccg ggg gga tgg tea ccc ggg ccg ggc tga ggc egg tgg gee gee tea ccg aag 
arg trp pro gly gly trp ser pro gly pro gly)OPA(gly arg trp ala ala ser pro lys 
301/101 331/111 ^^-SEQ ID NO: 855 

egg ccg age ggg tgg cgc gaa ccg acg acc tgc ggc cca tec ccg tct teg gca gcg acg 
arg pro ser gly trp arg glu pro thr thr cys gly pro ser pro ser ser ala ala thr 
361/121 391/131 

aat tgg cca ggc tga cag agg cat tea att taa tgc tgc ggg cgc tgg ccg agt cac ggg 
asn trp pro gly)OPA(gln arg his ser ile) OCH (cys cys gly arg trp pro ser his gly 
421/141 ^SEQ ID NO: 856 451/15T^SEQ ID NO: 857 

aac ggc agg caa ggc tgg tta ccg acg ccg gac atg aat tgc gta ccc cgc taa cgt cgc 
asn gly arg gin gly trp leu pro thr pro asp met asn cys val pro arg) OCH (arg arg 

481/161 511/171 SEQ ID NO: 858 ^ 

tgc gca cca atg teg aac tct tga tgg cct cga tgg ccc egg ggg etc cgc ggc tac cca 
cys ala pro met ser asn ser)OPA(trp pro arg trp pro arg gly leu arg gly tyr pro 
541/181 SEQ ID NO: 859 — 571/191 

age agg aga tgg teg acc tgc gtg ccg atg tgc tgg etc aaa teg agg aat tgt cca cac 
ser arg arg trp ser thr cys val pro met cys trp leu lys ser arg asn cys pro his 
601/201 631/211 

tgg tag gcg att tgg tgg acc tgt ccc gag gcg acg ccg gag aag tgg tgc acg age egg 
trp)AMB(ala ile trp trp thr cys pro glu ala thr pro glu lys trp cys thr ser arg 
661/221 ^'^-SEQ ID NO: 860 691/231 

teg aca tgg ctg acg teg teg acc gca gee tgg age ggg tea ggc ggc ggc gca acg ata . 
ser thr trp leu thr ser ser thr ala ala trp ser gly ser gly gly gly ala thr ile 
721/241 751/251 

tec ttt teg acg teg agg tga ttg ggt ggc agg ttt atg gcg ata ccg ctg gat tgt cgc 
ser phe ser thr ser arg)OPA(leu gly gly arg phe met ala ile pro leu asp cys arg 
781/261 SEQ ID NO: 861-^ 811/271 

gga tgg cgc tta acc tga tgg aca acg ccg cga agt gga gee cgc egg gcg gee acg tgg 
gly trp arg leu thr)OPA(trp thr thr pro arg ser gly ala arg arg ala ala thr trp 
841/281 SEQ ID NO: 862^ 871/291 

gtg tea ggc tga gee age teg acg cgt cgc acg ctg age tgg tgg ttt ccg acc gcg gee 
val ser gly)OPA(ala ser ser thr arg arg thr leu ser trp trp phe pro thr ala ala 
^-SEQ ID NO: 863 



SEQ ID NOSs 848-863 



FIG. 49A 



901/301 

egg gca ttc ccg tgc agg age gec gtc tgg 
arg ala phe pro cys arg ser ala val trp 
961/321 

cac ggg cgt tgc egg gtt egg gee teg ggt 
his gly arg cys arg val arg ala ser gly 
1021/341 

acg gcg gat tgc tgc gca teg aag aca ccg 
thr ala asp cys cys ala ser lys thr pro 
1081/361 

ttt acg tgc tgc tec ccg gec gtc gga tgc 
phe thr cys cys ser pro ala val gly cys 
1141/381 

gcg etc gga gca egg aca teg aga act etc 
ala leu gly ala arg thr ser arg thr leu 
1201/401 

etc agt cca cgc gcg caa cct agt tgt gca 
leu ser pro arg ala gin pro ser cys ala 
1261/421 

cca cgc atg gee aag ttg gee cga gta gtg 
pro arg met ala lys leu ala arg val val 
1321/441 

atg acg aat cac cca egg tat teg cca ccg 
met thr asn his pro arg tyr ser pro pro 
1381/461 

cag ggg cag cag caa acg tac age cag cag 
gin gly gin gin gin thr tyr ser gin gin 
1441/481 

ccg cag cca ace cag tac cgt caa ccc tac 
pro gin pro thr gin tyr arg gin pro tyr 
1501/501 

ata cct ggc gtg att ccg ace atg acg ccc 
ile pro gly val ile pro thr met thr pro 
1561/521 

gca ggc atg ttg gec ate ggc gcg gtg acg 
ala gly met leu ala ile gly ala val thr 
1621/541 

gcg gec gca tec ctg gtc ggg ttc aac egg 
ala ala ala ser leu val gly phe asn arg 
1681/561 

get gee age gcg gcg cca age ate ccc gca 
ala ala ser ala ala pro ser ile pro ala 
1741/581 

gtg gcg gec aag gtg gtg ccc agt gtc gtc 
val ala ala lys val val pro ser val val 
1801/601 

gag gag ggc tec ggc ate att ctg tct gec 
glu glu gly ser gly ile ile leu ser ala 
1861/621 

ate gcg gcg gec gec aag cct ccc ctg ggc 
ile ala ala ala ala lys pro pro leu gly 



931/311 

tgt ttg aac ggt ttt acc ggt egg cat egg 
cys leu asn gly phe thr gly arg his arg 
991/331 

tgg cga teg tea aac agg tgg tgc tea acc 
trp arg ser ser asn arg trp cys ser thr 
1051/351 

acc cag gcg gec age ccc ctg gaa cgt cga 
thr gin ala ala ser pro leu glu arg arg 
1111/371 

cga ttc cgc age ttc ccg gtg cga egg ctg 
arg phe arg ser phe pro val arg arg leu 
1171/391 

ggg gtt egg cga acg tta tct cag tgg aat 
gly val arg arg thr leu ser gin trp asn 
1231/411 

gtt act gtt gaa age cac acc cat gec agt 
val thr val glu ser his thr his ala ser 
1291/431 

ggc eta gta cag gaa gag caa cct age gac 
gly leu val gin glu glu gin pro ser asp 
1351/451 

ccg cag cag ccg gga acc cca ggt tat get 
pro gin gin pro gly thr pro gly tyr ala 
1411/471 

ttc gac tgg cgt tac cca ccg tec ccg ccc 
phe asp trp arg tyr pro pro ser pro pro 
1471/491 

gag gcg ttg ggt ggt acc egg ccg ggt ctg 
glu ala leu gly gly thr arg pro gly leu 
1531/511 

cct cct ggg atg gtt cgc caa cgc cct cgt 
pro pro gly met val arg gin arg pro arg 
1591/531 

ata gcg gtg gtg tec gee ggc ate ggc ggc 
ile ala val val ser ala gly ile gly gly 
1651/551 

gca ccc gec ggc ccc age ggc ggc cca gtg 
ala pro ala gly pro ser gly gly pro val 
1711/571 

gca aac atg ccg ccg ggg teg gtc gaa cag 
ala asn met pro pro gly ser val glu gin 
1771/591 

atg ttg gaa acc gat ctg ggc cgc cag teg 
met leu glu thr asp leu gly arg gin ser 
1831/611 

gag ggg ctg ate ttg acc aac aac cac gtg 
glu gly leu ile leu thr asn asn his val 
1891/631 

agt ccg ccg ccg aaa acg acg gta 
ser pro pro pro lys thr thr val) 



SEQ ID NOSs 848-863 (continued 1) 



FiOo 49 A (continued 1) 




1/1^__-SEQ ID NO: 864 31/11 

cat eta cac cgc tea aca gec ggg cca gac get gee ggt egg tgc tgc cga gaa ggc ggt 
( his l eu his arg ser thr ala gly pro asp ala ala gly arg cys cys arg glu gly gly 
61/ 21" SEQ ID NO: 865 91/31 

gat ccg tgg cga gtt gtt cat gtc gcg gcg cac cac cgc cga cca acg ggt get tgc cat 
asp pro . trp arg val val his val ala ala his his arg arg pro thr gly ala cys his 
121/41 151/51 

ccg tct gac caa egg tag ttc get get gat etc caa aag tct caa gee cac cga age agt 
pro ser asp gin arg)AMB(phe ala ala asp leu gin lys ser gin ala his arg ser ser 
181/61 SEQ ID NO: 866—^ 211/71 

cat gaa caa get gcg ttg ggt get att gat cgt ggg tgg gat egg ggt ggc ggt cgc cgc 
his glu gin ala ala leu gly ala ile asp arg gly trp asp arg gly gly gly arg arg 
241/81 271/91 

ggt ggc egg ggg gat ggt cac ccg ggc egg get gag gee ggt ggg ccg cct cac cga age 
gly gly arg gly asp gly his pro gly arg ala glu ala gly gly pro pro his arg ser 
301/101 331/111 

ggc cga gcg ggt ggc gcg aac cga cga cct gcg gee cat ccc cgt ctt egg cag cga cga 
gly arg ala gly gly ala asn arg arg pro ala ala his pro arg leu arg gin arg arg 
361/121 391/131 

att ggc cag get gac aga ggc att caa ttt aat get gcg ggc get ggc cga gtc acg gga 
ile gly gin ala asp arg gly ile gin phe asn ala ala gly ala gly arg val thr gly 
421/141 ' 451/151* 

acg gca ggc aag get ggt tac cga cgc egg aca tga att gcg tac ccc get aac gtc get 
thr ala gly lys ala gly tyr arg arg arg thr)OPA(ile ala tyr pro ala asn val ala 
481/161 511/171 ^-SEQ ID NO: 867 

gcg cac caa tgt cga act. ctt gat ggc etc gat ggc ccc ggg ggc tec gcg get ace caa 
ala his gin cys arg thr leu asp gly leu asp gly pro gly gly ser ala ala thr gin 
541/181 571/191 

gca gga gat ggt cga cct gcg tgc cga tgt get ggc tea aat cga gga att gtc cac act 
ala gly asp gly arg pro ala cys arg cys ala gly ser asn arg gly ile val his thr 
601/201 631/211 

ggt agg cga ttt ggt gga cct gtc ccg agg cga cgc egg aga agt ggt gca cga gec ggt 
gly arg arg phe gly gly pro val pro arg arg arg arg arg ser gly ala arg ala gly 
661/221 691/231 

cga cat ggc tga cgt cgt cga ccg cag cct gga gcg ggt cag gcg gcg gcg caa cga tat 
arg his gly)OPA(arg arg arg pro gin pro gly ala gly gin ala ala ala gin arg tyr 
721/241 ^-SEQ ID NO: 868 751/251 

cct ttt cga cgt cga ggt gat tgg gtg gca ggt tta tgg cga tac cgc tgg att gtc gcg 
pro phe arg arg arg gly asp trp val ala gly leu trp arg tyr arg trp ile val ala 
781/261 811/271 

gat ggc get taa cct gat gga caa cgc cgc gaa gtg gag ccc gec ggg egg cca cgt ggg 
asp gly ala)OCH(pro asp gly gin arg arg glu val glu pro ala gly arg pro arg gly 
841/281 ^SEQ ID NO: 869 871/291 

tgt cag get gag cca get cga cgc gtc gca cgc tga get ggt ggt ttc cga ccg egg ccc 

cys gin ala glu pro ala arg arg val ala arg)OPA(ala gly gly phe arg pro arg pro 

901/301 931/311 SEQ ID NO: 870 

ggg cat tec cgt gca gga gcg ccg tct ggt gtt tga acg gtt tta ccg gtc ggc ate ggc 

gly his ser arg ala gly ala pro ser gly val)OPA(thr val leu pro val gly ile gly 

961/321 991/331 ^-SEQ ID NO: 871 

acg ggc gtt gee ggg ttc ggg cct egg gtt ggc gat cgt caa aca ggt ggt get caa cca 

thr gly val ala gly phe gly pro arg val gly asp arg gin thr gly gly ala gin pro 

1021/341 1051/351 

egg egg att get gcg cat cga aga cac cga ccc agg egg cca gee ccc tgg aac gtc gat 
arg arg ile ala ala his arg arg his arg pro arg arg pro ala pro trp asn val asp 
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SEQ ID NOS: 864-877 (continued 1) 

FIG. 49B ( c ont inued 1 ) 




ate tac acc get caa cag ccg ggc cag acg 
( ile t yr thr ala gin gin pro gly gin thr 
61/21"" — SEQ ID NO: 879 

ate cgt ggc gag ttg ttc atg teg egg cgc 
ile arg gly glu leu phe met ser arg arg 
121/41 

cgt ctg acc aac ggt agt teg ctg ctg ate 
arg leu thr asn gly ser ser leu leu ile 
181/61 

atg aac aag ctg cgt tgg gtg eta ttg ate 
met asn lys leu arg trp val leu leu ile 
241/81 

gtg gec ggg ggg atg gtc acc egg gec ggg 
val ala gly gly met val thr arg ala gly 
301/101 

gee gag egg gtg gcg cga acc gac gac ctg 
ala glu arg val ala arg thr asp asp leu 
361/121 

ttg gec agg ctg aca gag gca ttc aat tta 
leu ala arg leu thr glu ala phe asn leu 
421/141 

egg cag gca agg ctg gtt acc gac gee gga 
arg gin ala arg leu val thr asp ala gly 
481/161 

cgc acc aat gtc gaa etc ttg atg gee teg 
arg thr asn val glu leu leu met ala ser 
541/181 

cag gag atg gtc gac ctg cgt gee gat gtg 
gin glu met val asp leu arg ala asp val 
601/201 

gta ggc gat ttg gtg gac ctg tec cga ggc 
val gly asp leu val asp leu ser arg gly 
661/221 

gac atg get gac gtc gtc gac cgc age ctg 
asp met ala asp val val asp arg ser leu 
721/241 

ctt ttc gac gtc gag gtg att ggg tgg cag 
leu phe asp val glu val ile gly trp gin 
781/261 

atg gcg ctt aac ctg atg gac aac gee gcg 
met ala leu asn leu met asp asn ala ala 
841/281 

gtc agg ctg age cag etc gac gcg teg cac 
val arg leu ser gin leu asp ala ser his 
901/301 

ggc att ccc gtg cag gag cgc cgt ctg gtg 
gly ile pro val gin glu arg arg leu val 
961/321 

egg gcg ttg ccg ggt teg ggc etc ggg ttg 
arg ala leu pro gly ser gly leu gly leu 



31/11 

ctg ccg gtc ggt get gee gag aag gcg gtg 
leu pro val gly ala ala glu lys ala val 
91/31 

acc acc gee gac caa egg gtg ctt gee ate 
thr thr ala asp gin arg val leu ala ile 
151/51 

tec aaa agt etc aag ccc acc gaa gca gtc 
ser lys ser leu lys pro thr glu ala val 
211/71 

gtg ggt ggg ate ggg gtg gcg gtc gee gcg 
val gly gly ile gly val ala val ala ala 
271/91 

ctg agg ccg gtg ggc cgc etc acc gaa gcg 
leu arg pro val gly arg leu thr glu ala 
331/111 

egg ccc ate ccc gtc ttc ggc age gac gaa 
arg pro ile pro val phe gly ser asp glu 
391/131 

atg ctg egg gcg ctg gec gag tea egg gaa 
met leu arg ala leu ala glu ser arg glu 
451/151 

cat gaa ttg cgt acc ccg eta acg teg ctg 
his glu leu arg thr pro leu thr ser leu 
511/171 

atg gee ccg ggg get ccg egg eta ccc aag 
met ala pro gly ala pro arg leu pro lys 
571/191 

ctg get caa ate gag gaa ttg tec aca ctg 
leu ala gin ile glu glu leu ser thr leu 
631/211 

gac gee gga gaa gtg gtg cac gag ccg gtc 
asp ala gly glu val val his glu pro val 
691/231 

gag egg gtc agg egg egg cgc aac gat ate 
glu arg val arg arg arg arg asn asp ile 
751/251 

gtt tat ggc gat acc get gga ttg teg egg 
val tyr gly asp thr ala gly leu ser arg 
811/271 

aag tgg age ccg ccg ggc ggc cac gtg ggt 
lys trp ser pro pro gly gly his val gly 
871/291 

get gag ctg gtg gtt tec gac cgc ggc ccg 
ala glu leu val val ser asp arg gly pro 
931/311 

ttt gaa egg ttt tac egg teg gca teg gca 
phe glu arg phe tyr arg ser ala ser ala 
991/331 

gcg ate gtc aaa cag gtg gtg etc aac cac 
ala ile val lys gin val val leu asn his 



SEQ ID NOSs 878-882 

FIGo 49C 



1021/341 1051/351 

ggc gga ttg ctg cgc ate gaa gac acc gac cca ggc ggc cag ccc cct gga acg teg att 
gly gly leu leu arg ile glu asp thr asp pro gly gly gin pro pro gly thr ser ile 
1081/361 1111/371 

tac gtg ctg etc ccc ggc cgt egg atg ccg att ccg cag ctt ccc ggt gcg acg get ggc 
tyr val leu leu pro gly arg arg met pro ile pro gin leu pro gly ala thr ala gly 
1141/381 1171/391 

get egg age acg gac ate gag aac tct egg ggt teg gcg aac gtt ate tea gtg gaa tct 
ala arg ser thr asp ile glu asn ser arg gly ser ala asn val ile ser val glu ser 
1201/401 1231/411 

cag tec acg cgc gca acc tag ttg tgc agt tac tgt tga aag cca cac cca tgc cag tec 

gin ser thr arg ala thr)AMB(leu cys ser tyr cys)OPA(lys pro his pro cys gin ser 

1261/421 SEQ ID NO: 880"^ 1291/431 ^SEQ ID NO: 881 

acg cat ggc caa gtt ggc ccg agt agt ggg cct agt aca gga aga gca acc tag cga cat 

thr his gly gin val gly pro ser ser gly pro ser thr gly arg ala thr)AMB(arg his 

1321/441 1351/451 SEQ ID NO: 882-^ 

gac gaa tea ccc acg gta ttc gee acc gee gca gca gee ggg aac ccc agg tta tgc tea 

asp glu ser pro thr val phe ala thr ala ala ala ala gly asn pro arg leu cys ser 

1381/461 1411/471 

999 gca gca gca aac gta cag cca gca gtt cga ctg gcg tta ccc acc gtc ccc gee ccc 
gly ala ala ala asn val gin pro ala val arg leu ala leu pro thr val pro ala pro 
1441/481 1471/491 

gca gee aac cca gta ccg tea acc eta cga ggc gtt ggg tgg tac ccg gee ggg tct gat 
ala ala asn pro val pro ser thr leu arg gly val gly trp tyr pro ala gly ser asp 
1501/501 1531/511 

acc tgg cgt gat tec gac cat gac gee ccc tec tgg gat ggt teg cca acg ccc teg tgc 
thr trp arg asp ser asp his asp ala pro ser trp asp gly ser pro thr pro ser cys 
1561/521 1591/531 

agg cat gtt ggc cat egg cgc ggt gac gat age ggt ggt gtc cgc egg cat egg egg cgc 
arg his val gly his arg arg gly asp asp ser gly gly val arg arg his arg arg arg 
1621/541 1651/551 

ggc cgc ate cct ggt egg gtt caa ccg ggc acc cgc egg ccc cag egg egg ccc agt ggc 
gly arg ile pro gly arg val gin pro gly thr arg arg pro gin arg arg pro ser gly 
1681/561 1711/571 

tgc cag cgc ggc gee aag cat ccc cgc age aaa cat gee gee ggg gtc ggt cga aca ggt 
cys gin arg gly ala lys his pro arg ser lys his ala ala gly val gly arg thr gly 
1741/581 1771/591 

ggc ggc caa ggt ggt gee cag tgt cgt cat gtt gga aac cga tct ggg ccg cca gtc gga 
gly gly gin gly gly ala gin cys arg his val gly asn arg ser gly pro pro val gly 
1801/601 1831/611 

gga ggg etc egg cat cat tct gtc tgc cga ggg get gat ctt gac caa caa cca cgt gat 
gly gly leu arg his his ser val cys. arg gly ala asp leu asp gin gin pro arg asp 
1861/621 1891/631 

cgc ggc ggc cgc caa gee tec cct ggg cag tec gee gee gaa aac gac ggt a 
arg gly gly arg gin ala ser pro gly gin ser ala ala glu asn asp gly) 
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ccc 


age 


aac 
yy^ 


crcrc 


cca 


ata 

y v -y 


get 


gcc 


ala ser 


leu 


val 


y -Ly 


phe 


asn 


arg 


ala 


pro 


ala gly 


pro 


ser 


crlv 
y j 


crlv 


pro 


val 


ala 


ala 


42 1/141 


















451/151 


















acre a co 


ci act 
y *—y 


cca 


acre 

<3.y \— 


ate 


ccc 


gca 


gca 


aac 


ata cccr 


ccg 


crcrcr 
yyy 


teg 


crtc 


gaa 


cag 


ata 
y *— y 


aca 

y y 


ser ala 


ala 


pro 


ser 


ile 


pro 


ala 


ala 


asn 


met pro 


pro 


crlv 


ser 


val 


crlu 


gin 


val 


ala 


481/161 


















511/171 


















gcc aag 


crtcr 


crtcr 
y *-y 


ccc 


agt 


gtc 


gtc 


atg 


tta 


gaa acc 


gat 


ctg 


crcrc 


cgc 


cag 


tea 


gag 


gag 


ala lys 


val 


val 


pro 


ser 


val 


val 


met 


leu 


glu thr 


asp 


leu 


gly 


arg 


gin 


ser 


glu 


glu 


541/181 


















571/191 


















ggc tec 


aac 


ate 


att 


ctg 


tct 


gcc 


cracr 


Gcrcr 

3 3 3 


ctg ate 


ttg 


acc 


aac 


aac 


cac 


gtg 


ate 


gcg 


gly ser 


gly 


ile 


ile 


leu 


ser 


ala 


glu 


gly 


leu ile 


leu 


thr 


asn 


asn 


his 


. val 


ile 


ala 


601/201 


















631/211 


















gcg gcc 


gcc 


aag 


cct 


ccc 


ctg 


ggc 


agt 


ccg 


ccg ccg 


aaa 


acg 


acg 


gta 


acc 


ttc 


tct 


gac 


ala ala 


ala 


lys 


pro 


pro 


leu 


gly 


ser 


pro 


pro pro 


lys 


thr 


thr 


val 


thr 


phe 


ser 


asp 


661/221 


















691/231 


















ggg egg 


acc 


gca 


ccc 


ttc 


acg 


gtg 


gtg 


ggg 


get gac 


ccc 


acc 


agt 


gat 


ate 


gcc 


gtc 


gtc 


gly arg 


thr 


ala 


pro 


phe 


thr 


val 


val 


gly 


ala asp 


pro 


thr 


ser 


asp 


ile 


ala 


val 


val 


721/241 


















751/251 


















cgt gtt 


cag 


ggc 


gtc 


tec 


ggg 


etc 


acc 


ccg 


ate tec 


ctg 


ggt 


tec 


tec 


teg 


gac 


ctg 


agg 


arg val 


gin 


gly 


val 


ser 


gly 


leu 


thr 


pro 


ile ser 


leu 


gly 


ser 


ser 


ser 


asp 


leu 


arg 


781/261 


















811/271 


















gtc ggt 


cag 


ccg 


gtg 


ctg 


gcg 


ate 


ggg 


teg 


ccg etc 


ggt 


ttg 


gag 


ggc 


acc 


gtg 


acc 


acg 


val gly 


gin 


pro 


val 


leu 


ala 


ile 


gly 


ser 


pro leu 


gly 


leu 


glu 


gly 


thr 


val 


thr 


thr 



SEQ ID ISTOSs 883-884 



FIG. 49D 




841/281 

ggg ate gtc age get etc aac cgt cca gtg 
gly ile .val ser ala leu asn arg pro val 
901/301 

ace gtg ctg gac gec att cag ace gac gee 
thr val leu asp ala ile gin thr asp ala 
961/321 

ctg gtg aac atg aac get caa etc gtc gga 
leu val asn met asn ala gin leu val gly 
1021/341 

gac tea gee gat gcg cag age ggc teg ate 
asp ser ala asp ala gin ser gly ser ile 
1081/361 

gee aag cgc ate gee gac gag ttg ate age 
ala lys arg ile ala asp glu leu ile ser 
1141/381 

gtg cag gtg ace aat gac aaa gac ace ctg 
val gin val thr asn asp lys asp thr leu 
1201/401 

ggt get gee gcg aac get gga gtg ccg aag 
gly ala ala ala asn ala gly val pro lys 
1261/421 

ccg ate aac age gcg gac gcg ttg gtt gec 
pro ile asn ser ala asp ala leu val ala 
1321/441 

gtg gcg eta ace ttt cag gat ccc teg ggc 
val ala leu thr phe gin asp pro ser gly 
1381/461 

aag gcg gag cag tga 
lys ala glu gln)OPA 



871/291 

teg acg ace ggc gag gee ggc aac cag aac 

ser thr thr gly glu ala gly asn gin asn 
931/311 

gcg ate aac ccc ggt aac tec ggg ggc gcg 

ala ile asn pro gly asn ser gly gly ala 
991/331 

gtc aac teg gec att gec acg ctg ggc gcg 

val asn ser ala ile ala thr leu gly ala 
1051/351 

ggt etc ggt ttt gcg att cca gtc gac cag 

gly leu gly phe ala ile pro val asp gin 
1111/371 

acc ggc aag gcg tea cat gee tec ctg ggt 

thr gly lys ala ser his ala ser leu gly 
1171/391 

ggc gee aag ate gtc gaa gta gtg gee ggt 

gly ala lys ile val glu val val ala gly 
1231/411 

ggc gtc gtt gtc acc aag gtc gac gac cgc 

gly val val val thr lys val asp asp arg 
1291/431 

gee gtg egg tec aaa gcg ccg ggc gee acg 

ala val arg ser lys ala pro gly ala thr 
1351/451 

ggt age cgc aca gtg caa gtc acc etc ggc 

gly ser arg thr val gin val thr leu gly 



SEQ ID NOSs 883-884 (continued 1) 
FSGo 49 D (continued 1) 



2 6 zm § 

ORF according to Cole et al . , 1998 {Nature 393:537-544) and containing Rv0983 
1A1_SEQ ID NO: 885 31/11 

tga gcc age teg acg cgt cgc acg ctg age tgg tgg ttt ccg acc gcg gee egg gca ttc 
OPA(ala ser ser thr arg arg thr leu ser trp trp phe pro thr ala ala arg ala phe 
61/21^-SEQ ID NO: 886 91/31 

ccg tgc agg age gcc gtc tgg tgt ttg aac ggt ttt acc ggt egg cat egg cac ggg cgt 
pro cys arg ser ala val trp cys leu asn gly phe thr gly arg his arg his gly arg 
121/41 151/51 

tgc egg gtt egg gcc teg ggt tgg cga teg tea aac agg tgg tgc tea acc acg gcg gat 
cys arg val arg ala ser gly trp arg ser ser asn arg trp cys ser thr thr ala asp 
181/61 211/71 

tgc tgc gca teg aag aca ccg acc cag gcg gcc age ccc ctg gaa cgt cga ttt acg tgc 
cys cys ala ser lys thr pro thr gin ala ala ser pro leu glu arg arg phe thr cys 
241/81 271/91 

tgc tec ccg gcc gtc gga tgc cga ttc cgc age ttc ccg gtg cga egg ctg gcg etc gga 
cys ser pro ala val gly cys arg phe arg ser phe pro val arg arg leu ala leu gly 
301/101 331/111 

gca egg aca teg aga act etc ggg gtt egg cga acg tta tct cag tgg aat etc agt cca 
ala arg thr ser arg thr leu gly val arg arg thr leu ser gin trp asn leu ser pro 
361/121 391/131 

cgc gcg caa cct agt tgt gca gtt act gtt gaa age cac acc cat gcc agt cca cgc atg 
arg ala gin pro ser cys ala val thr val glu ser his thr his ala ser pro arg met 
421/141 451/151 

gcc aag ttg gcc cga gta gtg ggc eta gta cag gaa gag caa cct age gac atg acg aat 
ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp met thr asn 
481/161 511/171 

cac cca egg tat teg cca ccg ccg cag cag ccg gga acc cca ggt tat get cag ggg cag 
his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala gin gly gin 
541/181 571/191 

cag caa acg tac age cag cag ttc gac tgg cgt tac cca ccg tec ccg ccc ccg cag cca 
gin gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro pro gin pro 
601/201 631/211 

acc cag tac cgt caa ccc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg ata cct ggc 
thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu ile pro gly 
661/221 691/231 

gtg att ccg acc atg acg ccc cct cct ggg atg gtt cgc caa cgc cct cgt gca ggc atg 
val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg ala gly met 
721/241 751/251 

ttg gcc ate ggc gcg gtg acg ata gcg gtg gtg tec gcc ggc ate ggc ggc gcg gcc gca 
leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly ala ala ala 
781/261 811/271 

tec ctg gtc ggg ttc aac egg gca ccc gcc ggc ccc age ggc ggc cca gtg get gcc age 
ser leu val gly phe asn arg ala pro ala gly pro ser gly gly pro val ala ala ser 
841/281 871/291 

gcg gcg cca age ate ccc gca gca aac atg ccg ccg ggg teg gtc gaa cag gtg gcg gcc 
ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin val ala ala 



SEQ ID NOSs 885-886 




901/301 

aag gtg gtg ccc agt gtc gtc atg ttg gaa 
lys val val pro ser val val met leu glu 
961/321 

tec ggc ate att ctg tct gec gag ggg ctg 
ser gly ile ile leu ser ala glu gly leu 
1021/341 

gee gec aag cct ccc ctg ggc agt ccg ccg 
ala ala lys pro pro leu gly ser pro pro 
1081/361 

egg acc gca ccc ttc acg gtg gtg ggg get 
arg thr ala pro phe thr val val gly ala 
1141/381 

gtt cag ggc gtc tec ggg etc acc ccg ate 
val gin gly val ser gly leu thr pro ile 
1201/401 

ggt cag ccg gtg ctg gcg ate ggg teg ccg 
gly gin pro val leu ala ile gly ser pro 
1261/421 

ate gtc age get etc aac cgt cca gtg teg 
ile val ser ala leu asn arg pro val ser 
1321/441 

gtg ctg gac gee att cag acc gac gee gcg 
val leu asp ala ile gin thr asp ala ala 
1381/461 

gtg aac atg aac get caa etc gtc gga gtc 
val asn met asn ala gin leu val gly val 
1441/481 

tea gee gat gcg cag age ggc teg ate ggt 
ser ala asp ala gin ser gly ser ile gly 
1501/501 

aag cgc ate gee gac gag ttg ate age acc 
lys arg ile ala asp glu leu ile ser thr 
1561/521 

cag gtg acc aat gac aaa gac acc ctg ggc 
gin val thr asn asp lys asp thr leu gly 
1621/541 

get gec gcg aac get gga gtg ccg aag ggc 
ala ala ala asn ala gly val pro lys gly 
1681/561 

ate aac age gcg gac gcg ttg gtt gee gee 
ile asn ser ala asp ala leu val ala ala 
1741/581 

gcg eta acc ttt cag gat ccc teg ggc ggt 

ala leu thr phe gin asp pro ser gly gly 

1801/601 

gcg gag cag tga 

ala glu gln)0PA 



931/311 

acc gat ctg ggc cgc cag teg gag gag ggc 
thr asp leu gly arg gin ser glu glu gly 
991/331 

ate ttg acc aac aac cac gtg ate gcg gcg 
ile leu thr asn asn his val ile ala ala 
1051/351 

ccg aaa acg acg gta acc ttc tct gac ggg 
pro lys thr thr val thr phe ser asp gly 
1111/371 

gac ccc acc agt gat ate gec gtc gtc cgt 
asp pro thr ser asp ile ala val val arg 
1171/391 

tec ctg ggt tec tec teg gac ctg agg gtc 
ser leu gly ser ser ser asp leu arg val 
1231/411 

etc ggt ttg gag ggc acc gtg acc acg ggg 
leu gly leu glu gly thr val thr thr gly 
1291/431 

acg acc ggc gag gee ggc aac cag aac acc 
thr thr gly glu ala gly asn gin asn thr 
1351/451 

ate aac ccc ggt aac tec ggg ggc gcg ctg 
ile asn pro gly asn ser gly gly ala leu 
1411/471 

aac teg gee att gee acg ctg ggc gcg gac 
asn ser ala ile ala thr leu gly ala asp 
1471/491 

etc ggt ttt gcg att cca gtc gac cag gee 
leu gly phe ala ile pro val asp gin ala 
1531/511 

ggc aag gcg tea cat gee tec ctg ggt gtg 
gly lys ala ser his ala ser leu gly val 
1591/531 

gee aag ate gtc gaa gta gtg gec ggt ggt 
ala lys ile val glu val val ala gly gly 
1651/551 

gtc gtt gtc acc aag gtc gac gac cgc ccg 
val val val thr lys val asp asp arg pro 
1711/571 

gtg egg tec aaa gcg ccg ggc gee acg gtg 
val arg ser lys ala pro gly ala thr val 
1771/591 

age cgc aca gtg caa gtc acc etc ggc aag 
ser arg thr val gin val thr leu gly lys 



SEQ ID NOSs 885-886 (continued 1) 

FIG. 49E (continued 1) 




Fragment amplified by PCR based on the sequence similarities with a serine 
protease of the E.coli HtrA family (creation of an SnaBI site at the 
3' end) and subcloned into the vector pJVEDa: 



1/l^SEQ ID NO: 887 31/11 



gat ccg 


gcg 


ggg 


egg 


gtg 


teg 


gcg 


cag 


gcg 


tgg ctg 


gcg 


gtc 


acg 


gcg 


gtg 


egg 


gcg 


gtg 


(asp pro 


a ia 


gly 


arg 


val 


ser 


aia 


gin 


ala 


trp leu 


ala 


val 


thr 


ala 


val 


arg 


ala 


val 


61/2 1 - 


* SEQ 


ID NO: 888 










91/31 


















ccg ccg 


ggc 


tgt 


ggg 


gcg 


ccg 


gcg 


gcg 


gcg 


gtg gca 


atg 


gcg 


gga 


acg 


gcg 


ccg 


atg 


cca 


pro pro 


gly 


cys 


gly 


ala 


pro 


ala 


ala 


ala 


val ala 


met 


ala 


gly 


thr 


ala 


pro 


met 


pro 


121/41 


















151/51 


















aca teg 


tea 


gcg 


gtg 


gag 


acg 


gtg 


gec 


teg 


gcg gtg 


ccg 


gtg 


gcg 


gtg 


gcg 


gat 


ggc 


tct 


thr ser 


ser 


ala 


val 


glu 


thr 


val 


ala 


ser 


ala val 


pro 


val 


ala 


val 


ala 


asp 


gly 


ser 


181/61 


















211/71 


















acg gcg 


acg 


gcg 


ggg 


ccg 


gcg 


gac 


acg 


gcg 


gac aag 


gcg 


caa 


teg 


gec 


teg 


gcg 


gcg 


gcg 


thr ala 


thr 


ala 


gly 


pro 


ala 


asp 


thr 


ala 


asp lys 


ala 


gin 


ser 


ala 


ser 


ala 


ala 


ala 


241/81 


















271/91 


















ccg gcg 


gcg 


acg 


ggg 


gec 


agg 


gcg 


gcg 


ccg 


gec gcg 


gac 


tgt 


ggg 


gta 


ctg 


gcg 


gcg 


ccg 


pro ala 


ala 


thr 


gly 


ala 


arg 


ala 


ala 


pro 


ala ala 


asp 


cys 


gly 


val 


leu 


ala 


ala 


pro 


301/101 


















331/111 


















gcg gac 


acg 


gcg 


ggc 


aag 


gcg 


gtg 


gta 


ccg 


ggg gec 


cac 


cgc 


tgc 


ccg 


gtc 


agg 


cag 


gca 


ala asp 


thr 


ala 


gly 


lys 


ala 


val 


val 


pro 


gly ala 


his 


arg 


cys 


pro 


val 


arg 


gin 


ala 


361/12 1 


















391/131 


















tgg gcg 


ccg 


egg 


gtg 


gcg 


ccg 


gtg 


ggc 


tga 


teg gca 


acg 


gcg 


ggg 


ccg 


gcg 


gcg 


acg 


gcg 


trp ala 


pro 


arg 


val 


ala 


pro 


val 


gly)OPA(ser ala 


thr 


ala 


gly 


pro 


ala 


ala 


thr 


ala 


421/141 






SEQ ID NO: 


: 889 




451/151 


















gtg teg 


gcg 


cgt 


ccg 


gcg 


ggg 


teg 


ccg 


gag 


tag gcg 


gtg 


ccg 


gcg 


gga 


acg 


cca 


tgc 


tga 


val ser 


ala 


arg 


pro 


ala 


gly 


ser 


pro 


glu) AMB (ala 


val 


pro 


ala 


gly 


thr 


pro 


cys ) OPA 


481/161 


















511/171 


""~-SEQ ID NO: 


: 890 








teg ggc 


acg 


gcg 


gcg 


ccg 


gcg 


gcg 


ccg 


gcg 


gag aca 


gca 


gtt 


teg 


eta 


atg 


gcg 


egg 


ccg 


(ser gly 


thr 


ala 


ala 


pro 


ala 


ala 


pro 


ala 


glu thr 


ala 


val 


ser 


leu 


met 


ala 


arg 


pro 


541/181" 


* SEQ ID NO: 


: 891 








571/191 


















gcg gcg 


egg 


gcg 


gtg 


ccg 


gag 


ggc 


acc 


tct 


teg gca 


atg 


gcg 


ggt 


ccg 


gcg 


gec 


acg 


gcg 


ala ala 


arg 


ala 


val 


pro 


glu 


gly 


thr 


ser 


ser ala 


met 


ala 


gly 


pro 


ala 


ala 


thr 


ala 


601/201 


















631/211 


















gag ccg 


tea 


egg 


ccg 


gca 


aca 


ccg 


gta 


teg 


gtg gcg 


ccg 


gcg 


gcg 


teg 


gtg 


ggg 


acg 


cca 


glu pro 


ser 


arg 


pro 


ala 


thr 


pro 


val 


ser 


val ala 


pro 


ala 


ala 


ser 


val 


gly 


thr 


pro 


661/221 


















691/231 


















ggc tga 


teg 


gec 


acg 


gtg 


gcg 


ccg 


gcg 


gtg 


ccg gcg 


ggg 


acc 


gcg 


ccg 


gag 


cct 


tgg 


ttg 


gly)OPA(ser 


ala 


thr 


val 


ala 


pro 


ala 


val 


pro ala 


gly 


thr 


ala 


pro 


glu 


pro 


trp 


leu 


721/241 


^SEQ ID NO: 


: 892 








751/251 


















gec gtg 


acg 


gcg 


ggc 


ccg 


gtg 


gga 


acg 


ggg 


gcg ctg 


gcg 


gee 


age 


tat 


acg 


gca 


acg 


gcg 


ala val 


thr 


ala 


gly 


pro 


val 


gly 


thr 


gly 


ala leu 


ala 


ala 


ser 


tyr 


thr 


ala 


thr 


ala 


781/261 


















811/271 


















gcg acg 


gcg 


ccc 


ccg 


gca 


ccg 


gcg 


gaa 


cac 


tgc agg 


egg 


egg 


tga 


gcg 


gat 


tgg 


tga 


egg 


ala thr 


ala 


pro 


pro 


ala 


pro 


ala 


glu 


his 


cys arg 


arg 


arg)OPA 


ala 


asp 


trp 


OPA (arg 


841/281 


















871/291 








SEQ ID NO: 


; 893^ 


ctt tgt 


teg 


gtg 


cac 


ccg 


gec 


aac 


ccg 


gcg 


aca ccg 


gec 


aac 


ccg 


get 


age 


ccc 


gat 


caa 


leu cys 


ser 


val 


his 


pro 


ala 


asn 


pro 


ala 


thr pro 


ala 


asn 


pro 


ala 


ser 


pro 


asp 


gin 


901/301 


















931/311 


















cga ggg 


ttt 


egg 


tgc 


egg 


tec 


ggg 


gca 


tgg 


cca tec 


get 


gag 


ctg 


gcg 


ate 


tgg 


act 


acg 


arg gly 


phe 


arg 


cys 


arg 


ser 


gly 


ala 


trp 


pro ser 


ala 


glu 


leu 


ala 


ile 


trp 


thr 


thr 


961/321 


















991/331 


















ttg gtg 


tag 


aaa 


aat 


cct 


gec 


gee 


egg 


acc 


ctt aag 


get 


ggg 


aca 


att 


tct 


gat 


age 


tac 


leu val)AMB(lys 


asn 


pro 


ala 


ala 


arg 


thr 


leu lys 


ala 


gly 


thr 


ile 


ser 


asp 


ser 


tyr 


1021/341 


* — SEQ ID NO: 


894 






1051/351 
















ccc gac 


aca 


gga 


ggt 


tac 


ggg 


atg 


age 


aat 


teg cgc 


cgc 


cgc 


tea 


etc 


agg 


tgg 


tea 


tgg 


pro asp 


thr 


gly 


gly 


tyr 


gly 


met 


ser 


asn 


ser arg 


arg 


arg 


ser 


leu 


arg 


trp 


ser 


trp 


1081/361 
















1111/371 
















ttg ctg 


age 


gtg 


ctg 


get 


gee 


gtc 


ggg 


ctg 


ggc ctg 


gec 


acg 


gcg 


ccg 


gee 


cag 


gcg 


gee 


leu leu" ser 


val 


leu 


ala 


ala 


val 


gly 


leu 


gly leu 


ala 


thr 


ala 


pro 


ala 


gin 


ala 


ala 



1141/381 

ccg ccg gee ttg teg cag gac egg tt 
pro pro ala leu ser 'gin asp arg) 

SEQ ID NOS:887-894 



FIG. 50A 



1/1 SEQ ID NO: 


895 










31/11 


















ate egg 


egg 


ggc 


ggg 


tgt 


egg 


cgc 


agg 


cgt 


ggc tgg 


egg 


tea 


egg 


egg 


tgc 


ggg 


egg 


tgc 


(ile arg 


arg 


gly 


gly 


cys 


arg 


arg 


arg 


arg 


gly trp 


arg 


ser 


arg 


arg 


cys 


gly 


arg 


cys 


61/21^SEQ ID NO: 896 










91/31 


















cgc egg 


get 


gtg 


ggg 


cgc 


egg 


egg 


egg 


egg 


tgg caa 


tgg 


egg 


gaa 


egg 


cgc 


cga 


tgc 


caa 


arg arg 


ala 


val 


gly 


arg 


arg 


arg 


arg 


arg 


trp gin 


trp 


arg 


glu 


arg 


arg 


arg 


cys 


gin 


121/41 


















151/51 


















cat cgt 


cag 


egg 


tgg 


aga 


egg 


tgg 


cct 


egg 


egg tgc 


egg 


tgg 


egg 


tgg 


egg 


atg 


get 


eta 


his arg 


gin 


arg 


trp 


arg 


arg 


trp 


pro 


arg 


arg cys 


arg 


trp 


arg 


trp 


arg 


met 


ala 


leu 


181/61 


















211/71 


















egg cga 


egg 


egg 


ggc 


egg 


egg 


aca 


egg 


egg 


aca agg 


cgc 


aat 


egg 


cct 


egg 


egg 


egg 


cgc 


arg arg 


arg 


arg 


gly 


arg 


arg 


thr 


arg 


arg 


thr arg 


arg 


asn 


arg 


pro 


arg 


arg 


arg 


arg 


241/81 


















271/91 


















ccrcr ccrcr 


cga 


cacr 


crcrcr 

3 3 3 


cca 


ggg 


egg 


cgc 


egg 


ccg egg 


act 


gtg 


ggg 


tac 


tgg 


egg 


cgc 


egg 


arg arg 


arg 


arg 


gly 


pro 


gly 


arg 


arg 


arg 


pro arg 


thr 


val 


gly 


tyr 


trp 


arg 


arg 


arg 


301/101 


















331/111 


















egg aca 


egg 


ccrcr 

'-'3 3 


gca 


acrcr 


egg 


tgg 


tac 


egg 


ggg ccc 


acc 


get 


gec 


egg 


tea 


ggc 


agg 


cat 


arg thr 


arg 


arg 


ala 


arcr 


arg 


trp 


tyr 


arg 


gly pro 


thr 


ala 


ala 


arg 


ser 


gly 


arg 


his 


361/121 


















391/131 


















crcrcr cere 

3 3 3 V-y W 


cgc 


crcrcr 

3 3 3 


tcrcr 


cgc 


egg 


tgg 


get 


gat 


egg caa 


egg 


egg 


ggc 


egg 


egg 


cga 


egg 


egg 


gly arg 


arg 


gly 




arcr 


arg 


trp 


ala 


asp 


arg gin 


arg 


arg 


gly 


arg 


arg 


arg 


arg 


arg 


421/141 


















451/151 


















tgt egg 


cgc 


gtc 


ccrcr 

3 3 


cacr 

^ 3 3 


ggt 


cgc 


egg 


agt 


agg egg 


tgc 


egg 


egg 


gaa 


cgc 


cat 


get 


gat 


cys arg 


arg 


val 


arcr 


arcr 


gly 


arg 


arg 


ser 


arg arg 


cys 


arg 


arg 


glu 


arg 


his 


ala 


asp 


481/161 


















511/171 


















egg gca 


egg 


ccrcr 

33 


cgc 


ccrcr 

3 3 


egg 


cgc 


egg 


egg 


aga cag 


cag 


ttt 


cgc 


taa 


tgg 


cgc 


ggc 


egg 


arg ala 


arg 


arg 


arg 


arg 


arg 


arg 


arg 


arg 


arg gin 


gin 


phe 


arg)OCH(trp 


arg 


gly 


arg 


541/181 


















571/191 


SEQ ID 


NO: 


897' 










egg cgc 


ggg 


egg 


tgc 


egg 


agg 


gca 


cct 


ctt 


egg caa 


tgg 


egg 


gtc 


egg 


egg 


cca 


egg 


egg 


arg arg 


gly 


arg 


cys 


arg 


arg 


ala 


pro 


leu 


arg gin 


trp 


arg 


val 


arg 


arg 


pro 


arg 


arg 


601/201 


















631/211 


















age cgt 


cac 


ggc 


egg 


caa 


cac 


egg 


tat 


egg 


tgg cgc 


egg 


egg 


cgt 


egg 


tgg 


gga 


cgc 


cag 


ser arg 


his 


gly 


arg 


gin 


his 


arg 


tyr 


arg 


trp arg 


arg 


arg 


arg 


arg 


trp 


gly 


arg 


gin 


661/221 


















691/231 


















get gat 


egg 


cca 


egg 


tgg 


cgc 


egg 


egg 


tgc 


egg egg 


gga 


ccg 


cgc 


egg 


age 


ctt 


ggt 


tgg 


ala asp 


arg 


pro 


arg 


trp 


arg 


arg 


arg 


cys 


arg arg 


gly 


pro 


arg 


arg 


ser 


leu 


gly 


trp 


721/241 


















751/251 


















ccg tga 


egg 


egg 


gee 


egg 


tgg 


gaa 


egg 


ggg 


cgc tgg 


egg 


cca 


get 


ata 


egg 


caa 


egg 


egg 


pro)OPA(arg 


arg 


ala 


arg 


trp 


glu 


arg 


gly 


arg trp 


arg 


pro 


ala 


ile 


arg 


gin 


arg 


arg 


781/261 




SEQ 


ID NO: 898 








811/271 


















cga egg 


cgc 


ccc 


egg 


cac 


egg 


egg 


aac 


act 


gca ggc 


ggc 


ggt 


gag 


egg 


att 


ggt 


gac 


ggc 


arg arg 


arg 


pro 


arg 


his 


arg 


arg 


asn 


thr 


ala gly 


gly 


gly 


glu 


arg 


ile 


gly 


asp 


gly 


841/281 


















871/291 


















ttt gtt 


egg 


tgc 


ace 


egg 


cca 


acc 


egg 


cga 


cac egg 


cca 


acc 


egg 


eta 


gee 


ccg 


ate 


aac 


phe val 


arg 


cys 


thr 


arg 


pro 


thr 


arg 


arg 


his arg 


pro 


thr 


arg 


leu 


ala 


pro 


ile 


asn 


901/301 


















931/311 


















gag ggt 


ttc 


ggt 


gee 


ggt 


ccg 


ggg 


cat 


ggc 


cat ccg 


ctg 


age 


tgg 


cga 


tct 


gga 


eta 


cgt 


glu gly 


phe 


gly 


ala 


gly 


pro 


gly 


his 


gly 


his pro 


leu 


ser 


trp 


arg 


ser 


gly 


leu 


arg 


961/321 


















991/331 


















tgg tgt 


aga 


aaa 


ate 


ctg 


ccg 


ccc 


gga 


ccc 


tta agg 


ctg 


gga 


caa 


ttt 


ctg 


ata 


get 


acc 


trp cys 


arg 


lys 


ile 


leu 


pro 


pro 


gly 


pro 


leu arg 


leu 


gly 


gin 


phe 


leu 


ile 


ala 


thr 


1021/341 
















1051/351 
















ccg aca 


cag 


gag 


gtt 


acg 


gga 


tga 


gca 


att 


cgc gee 


gee 


get 


cac 


tea 


ggt 


ggt 


cat 


ggt 


pro thr 


gin 


glu 


val 


thr 


gly)OPA(ala 


ile 


arg ala 


ala 


ala 


his 


ser 


gly 


gly 


his 


gly 


1081/361 


SEQ ID 


NO: 


899 








1111/371 
















tgc tga 


gcg 


tgc 


tgg 


ctg 


ccg 


teg 


ggc 


tgg 


gee tgg 


cca 


egg 


cgc 


egg 


ccc 


agg 


egg 


ccc 


cys)OPA(ala 


cys 


trp 


leu 


pro 


ser 


gly 


trp 


ala trp 


pro 


arg 


arg 


arg 


pro 


arg 


arg 


pro 


1141/381^- SEQ ID NO 


: 900 


























cgc egg 


cct 


tgt 


cgc 


agg 


ace 


ggt 


t 






















arg arg 


pro 


cys 


arg 


arg 


thr 


gly) 























SEQ ID NOSs 895-900 

FIG.50B 




1/1 SEQ ID NO: 


901 










31/11 


















tec ggc 
(ser gly 
61/21^-— 


ggg 
gly 

SEQ 


gcg ggt gtc 
ala gly val 
ID NO: 902 


ggc 
gly 


gca 
ala 


ggc 
gly 


gtg 
val 


get ggc 
ala gly 
91/31 


ggt 
gly 


cac 
his 


ggc 

gly 


ggt 
gly 


gcg 

ala 


ggc 
gly 


ggt 
gly 


gee 
ala 


gec ggg 
ala gly 
121/41 


ctg 
leu 


tgg 
tro 


ggc 
alv 


gec 
ala 


ggc 
gly 


ggc 

gly 


ggc 
gly 


ggt 
gly 


ggc aat 
gly asn 
151/51 


ggc 
gly 


ggg 
gly 


aac 
asn 


ggc 
gly 


gec 
ala 


gat 
asp 


gec 
ala 


aac 
asn 


ate gtc 
ile val 
181/61 


age 
ser 


ggt 

crl \/ 
y -i-y 


gga 

crl v 
y .Y 


gac 
asp 


ggt 
gly 


ggc 
gly 


etc 
leu 


ggc 
gly 


ggt . gec 
gly ala 
211/71 


ggt 
gly 


ggc 
gly 


ggt 
gly 


ggc 
gly 


gga 
gly 


tgg 
trp 


etc 
leu 


tac 
tyr 


ggc gac 
gly asp 
241/81 


ggc 

olv 


ggg 

crl \/ 
y J-Jr 


gec 
ala 


ggc 


gga 
gly 


cac 
his 


ggc 
gly 


gga 
gly 


caa ggc 
gin gly 
271/91 


gca 
ala 


ate 
ile 


ggc 
gly 


etc 
leu 


ggc 
gly 


ggc 
gly 


ggc 
gly 


gec 
ala 


ggc ggc 
crlv crlv 
301/101 


gac 
asp 


ggg 

y J-y 


ggc 

crl \/ 


cag 
gin 


ggc 
gly 


ggc 
gly 


gee 
ala 


ggc 
gly 


cgc gga 
arg gly 
331/111 


ctg 
leu 


tgg 
trp 


ggt 
gly 


act 
thr 


ggc 
gly 


ggc 
gly 


gec 
ala 


ggc 
gly 


gga cac 
gly his 
361/121 


ggc 

crlv 


ggg 
gly 


caa 


ggc 
gly 


ggt 
gly 


ggt 
gly 


acc 
thr 


ggg 
gly 


ggc cca 
gly pro 
391/131 


ccg 
pro 


ctg 
leu 


ccc 
pro 


ggt 
gly 


cag 
gin 


gca 
ala 


ggc 

gly 


atg 
met 


ggc gec 
gly ala 
421/141 


gcg 
ala 


ggt 
gly 


ggc 

gly 


gee 
ala 


ggt 
gly 


ggg 

gly 


ctg 
leu 


ate 
ile 


ggc aac 
gly asn 
451/151 


ggc 

gly 


ggg 
gly 


gee 
ala 


ggc 
gly 


ggc 
gly 


gac 
asp 


ggc 
gly 


ggt 
gly 


gtc ggc 
val gly 
481/161 


gcg 
ala 


tec 
ser 


ggc 

crl v 
y -i-y 


ggg 

rrl \/ 
y -"-y 


gtc 
val 


gee 
ala 


gga 

gly 


gta 
val 


ggc ggt 
gly gly 
511/171 


gec 
ala 


ggc 
gly 


ggg 
gly 


aac 
asn 


gec 
ala 


atg 
met 


ctg 
leu 


ate 
ile 


ggg cac 
gly his 
541/181 


ggc 

crl v 


ggc 
gly 


gee 
ala 


ggc 
gly 


ggc 
gly 


gec 
ala 


ggc 

gly 


gga 

gly 


gac age 
asp ser 
571/191 


agt 
ser 


ttc 
phe 


get 
ala 


aat 
asn 


ggc 
gly 


gcg 
ala 


gee 
ala 


ggc 
gly 


ggc gcg 
gly ala 
601/201 


ggc 

crlv 
y -*-y 


ggt 
gly 


gec 


gga 
gly 


ggg 
gly 


cac 
his 


etc 
leu 


ttc 
phe 


ggc aat 
gly asn 
631/211 


ggc 
gly 


ggg 
gly 


tec 
ser 


ggc 
gly 


ggc 
gly 


cac 
his 


ggc 
gly 


gga 
gly 


gec gtc 
ala val 
661/221 


acg 
thr 


gee 
ala 


ggc 
gly 


aac 


acc 
thr 


ggt 
gly 


ate 
ile 


ggt 
gly 


ggc gec 
gly ala 
691/231 


ggc 
gly 


ggc 
gly 


gtc 
val 


ggt 
gly 


ggg 
gly 


gac 
asp 


gec 
ala 


agg 
arg 


ctg ate 
leu ile 
721/241 


ggc 

crl v 


cac 
his 


ggt 

y j.y 


ggc 

crl v 
y - L .Y 


gee 
ala 


ggc 
gly 


ggt 

gly 


gec 
ala 


ggc ggg 
gly gly 
751/251 


gac 
asp 


cgc 
arg 


gec 
ala 


gga 
gly 


gee 
ala 


ttg 
leu 


gtt 

val 


ggc 
gly 


cgt gac 
arg asp 
781/261 


ggc 

crlv 

y JL 


ggg 

crlv 
y - L .y 


ccc 
pro 


ggt 

crlv 
y y 


ggg 
gly 


aac 
asn 


ggg 
gly 


ggc 
gly 


get ggc 
ala gly 
811/271 


ggc 
gly 


cag 
gin 


eta 
leu 


tac 
tyr 


ggc 
gly 


aac 
asn 


ggc 

gly 


ggc 
gly 


gac ggc 
asp gly 
841/281 


gec 
ala 


ccc 
pro 


ggc 

alv 


acc 
thr 


ggc 
gly 


gga 
gly 


aca 
thr 


ctg 
leu 


cag gcg 
gin ala 
871/291 


gcg 
ala 


gtg 
val 


age 
ser 


gga 
gly 


ttg 
leu 


gtg 
val 


acg 
thr 


get 
ala 


ttg ttc 
leu phe 
901/301 


ggt 
gly 


gca 
ala 


CCC 

pro 


ggc 
gly 


caa 
gin 


ccc 
pro 


ggc 
gly 


gac 
asp 


acc ggc 
thr gly 
931/311 


caa ccc 
gin pro 
SEQ ID 


ggc tag ccc 
gly ) AMB (pro 
NO; 903' 


cga 
arg 


tea 
ser 


acg 
thr 


agg gtt 
arg val 
961/321 


teg 
ser 


gtg 
val 


ccg 
pro 


gtc 
val 


egg 
arg 


ggc 
gly 


atg 
met 


gee 
ala 


ate cgc tga get ggc gat ctg gac 
ile arg)OPA(ala gly asp leu asp 
991/331 t — SEQ ID NO: 904 


tac 
tyr 


gtt 

val 


ggt gta gaa 
gly val glu 
1021/341 


aaa 
lys 


tec 
ser 


tgc 
cys 
SEQ 


cgc ccg gac 
arg pro asp 
ID NO: 905 — 


cct taa ggc tgg gac aat ttc tga 
pro)OCH_(gly trp asp asn phe)OPA 
■ ^T5Tl/351 SEQ ID NO 906 


tag eta 
AMB (leu 


ccc 
pro 


cga cac agg 
arg his arg 
1081/361 


agg 
arg 


tta 
leu 


egg 
arg 


gat 
asp 


gag 

glu 


caa 
gin 


ttc 
phe 


gcg ccg ccg 
ala pro pro 
1111/371 


etc 
leu 


act 
thr 


cag 
gin 


gtg 
val 


gtc 
val 


atg 
met 


gtt 

val 


get gag cgt 
ala glu arg 
1141/381 


get 
ala 


ggc 
gly 


tgc 
cys 


cgt 
arg 


egg 
arg 


get 
ala 


ggg 
gly 


cct ggc 
pro gly 


cac 
his 


ggc 
gly 


gec 
ala 


ggc 

gly 


cca 
pro 


ggc 
gly 


ggc 
gly 


ccc 
pro 


gee ggc 
ala gly 


ctt 
leu 


gtc 
val 


gca 
ala 


gga 
gly 


ccg 
pro 


gtt 
val) 























SEQ ID NOSs 901-906 

FIG.50C 



Coding sequence Rv0125 predicted by Cole et al . , 1998 (Nature 393:537-544) 
and containing seqSOA: 



1A1_^SEQ ID NO: 


907 










atg age 


aat 


teg 


cgc 


cgc 


cere 
a 


tea 


etc 


acrcr 
a yy 


(Met ser 


asn 


ser 


arg 


arg 


arg 


ser 


leu 


arg 


6T/21 


SEQ 


ID NO: 908 










crtc crcrcr 
y yyy 


c tg 


crcrc 


c tg 


gee 


acg 


crccr 


ccg 


gee 


val crlv 


1 eu 


cr lv 


leu 


ala 


thr 


ala 


pro 


ala 


121/41 


















ccrcr ttc 


gec 


gac 


ttc 


ccc 


crccr 
3 ^-y 


ctg 


ccc 


etc 


arg phe 


ala 


asp 


phe 


pro 


ala 


leu 


pro 


leu 


181/61 


















cca cag 


crtcr 

3 ^-3 


gtc 


aac 


ate 


aac 


acc 


aaa 


ctg 


"T>r~o crln 


val 


val 


asn 


ile 


asn 


thr 


lvs 


leu 


241/81 


















crcrc ate 


gtc 


ate 


era t 


ccc 


aac 


OCT t 
33^* 


gtc 


crtcr 


gly ile 


va 1 


ile 


asp 


pro 


asn 


olv 


val 


val 


301/101 

V-J / -1_ W 


















acr crac 


ate 


aat 


erect 


ttc 


age 


gtc 


crcrc 
y y v - 


tec 


L1IX CI O 


ile 


asn 


ala 


phe 


ser 


val 


crlv 


ser 


3 61/121 




















cgc 


acc 


cag 


gat 


crtc 


crccr 


crtcr 
y *-y 


ctg 


tyr asp 


arg 


thr 


gin 


asp 


val 


ala 


val 


leu 


421/ 141 


















fTCfT fTCfT 


ate 


OCT t 

yy L 


y y 


crcrc 
yy u 


gtc 


crccr 
y *-y 


att 


crcr t 
yy *- 


=) 1 o al a 


ile 


y y 


y Jr 


y - L y 


val 


ala 


val 


cr lv 

y Jr 


481/161 


















crcrcr caa 

3 3 3 3 


qctc 
33^* 


crQa 
y y tA 


acg 


ccc 


cert 


crccr 
3 3 


gtg 


cct 


gly gin 


crlv 


crlv 

3 Jr 


thr 


nro 


arg 


ala 


val 


pro 


541/181 


















cacr Qccr 


teg 


gat 


teg 


ctg 


acc 


ggt 


gee 


gaa 


gin ala 


ser 


asp 


ser 


leu 


thr 


gly 


ala 


glu 


601/201 


















gec gcg 


ate 


cag 


ccc 


ggt 


gat 


teg 


ggc 


ggg 


ala ala 


ile 


gin 


pro 


gly 


asp 


ser 


gly 


gly 


661/221 


















ggt atg 


aac 


acg 


gee 


gcg 


tec 


gat 


aac 


ttc 


g ly me t 


asn 


thr 


ala 


ala 


ser 


asp 


asn 


phe 


721/241 


















att ccg 


ate 


ggg 


cag 


gcg 


atg 


gcg 


ate 


gcg 


ile pro 


ile 


gly 


gin 


ala 


met 


ala 


ile 


ala 


781/261 


















acc gtt 


cat 


ate 


ggg 


cct 


acc 


gec 


ttc 


etc 


thr val 


his 


ile 


gly 


pro 


thr 


ala 


phe 


leu 


841/281 


















ggc gca 


cga 


gtc 


caa 


cgc 


gtg 


gtc 


ggg 


age 


gly ala 


arg 


val 


gin 


arg 


val 


val 


gly 


ser 


901/301 


















ggc gac 


gtg 


ate 


acc 


gcg 


gtc 


gac 


ggc 


get 


gly asp 


val 


ile 


thr 


ala 


val 


asp 


gly 


ala 


961/321 


















gcg ctt 


aac 


ggg 


cat 


cat 


ccc 


ggt 


gac 


gtc 


ala leu 


asn 


gly 


his 


his 


pro 


gly 


asp 


val 


1021/341 
















ggc acg 


cgt 


aca 


ggg 


aac 


gtg 


aca 


ttg 


gec 


gly thr 


arg 


thr 


gly 


asn 


val 


thr 


leu 


ala 



SEQ ID 



31/11 

tgg tea tgg ttg ctg age gtg ctg get gee 
trp ser trp leu leu ser val leu ala ala 
91/31 

cag gcg gec ccg ccg gee ttg teg cag gac 
gin ala ala pro pro ala leu ser gin asp 
151/51 

gac ccg tec gcg atg gtc gec caa gtg ggg 
asp pro ser ala met val ala gin val gly 
211/71 

ggc tac aac aac gee gtg ggc gec ggg acc 
gly tyr asn asn ala val gly ala gly thr 
271/91 

ctg acc aac aac cac gtg ate gcg ggc gee 
leu thr asn asn his val ile ala gly ala 
331/111 

ggc caa acc tac ggc gtc gat gtg gtc ggg 
gly gin. thr tyr gly val asp val val gly 
391/131 

cag ctg cgc ggt gec ggt ggc ctg ccg teg 
gin leu arg gly ala gly gly leu pro ser 
451/151 

gag ccc gtc gtc gcg atg ggc aac age ggt 
glu pro val val ala met gly asn ser gly 
511/171 

ggc agg gtg gtc gcg etc ggc caa acc gtg 
gly arg val val aia leu gly gin thr val 
571/191 

gag aca ttg aac ggg ttg ate cag ttc gat 
glu thr leu asn gly leu ile gin phe asp 
631/211 

ccc gtc gtc aac ggc eta gga cag gtg gtc 
pro val val asn gly leu gly gin val val 
691/231 

cag ctg tec cag ggt ggg cag gga ttc gec 
gin leu ser gin gly gly gin gly phe ala 
751/251 

ggc cag ate cga teg ggt ggg ggg tea ccc 
gly gin ile arg ser gly gly gly ser pro 
811/271 

ggc ttg ggt gtt gtc gac aac aac ggc aac 
gly leu gly val val asp asn asn gly asn 
871/291 

get ccg gcg gca agt etc ggc ate tec acc 
ala pro ala ala ser leu gly ile ser thr 
931/311 

ccg ate aac teg gee acc gcg atg gcg gac 
pro ile asn ser ala thr ala met ala asp 
991/331 

ate teg gtg acc tgg caa acc aag teg ggc 
ile ser val thr trp gin thr lys ser gly 
1051/351 

gag gga ccc ccg gec tga 
glu gly pro pro ala)OPA 

:>Ss907-908 



FIG.50D 



2 6%$ 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv0125: 



1 /1 ^ SEQ ID NO: 909 31/11 

tag aaa aat cct gcc gcc egg acc ctt aag get ggg aca att 

AMB ( lys asn pro ala ala arg thr leu lys ala giy thr ile 

61/2? SEQ ID NO: 910 91/31 

aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc 

thr giy giy tyr giy met ser asn ser arg arg arg ser leu 

121/41 151/51 

age gtg ctg get gcc gtc ggg ctg ggc ctg gcc acg gcg ccg 

ser val leu ala ala val giy leu giy leu ala thr ala pro 

181/61 211/71 

gcc ttg teg cag gac egg ttc gcc gac ttc ccc gcg ctg ccc 

ala leu ser gin asp arg phe ala asp phe pro ala leu pro 

241/81 271/91 

gtc gcc caa gtg ggg cca cag gtg gtc aac ate aac acc aaa 

val ala gin val giy pro gin val val asn ile asn thr lys 

301/101 331/111 

gtg ggc gcc ggg acc ggc ate gtc ate gat ccc aac ggt gtc 

val giy ala giy thr giy ile val ile asp pro asn giy val 

361/121 391/131 

gtg ate gcg ggc gcc acc gac ate aat gcg ttc age gtc ggc 

val ile ala giy ala thr asp ile asn ala phe ser val giy 

421/141 451/151 ■ 

gtc gat gtg gtc ggg tat gac cgc acc cag gat gtc gcg gtg 

val asp val val giy tyr asp arg thr gin asp val ala val 

481/161 511/171 

ggt ggc ctg ccg teg gcg gcg ate ggt ggc ggc gtc gcg gtt 

giy giy leu pro ser ala ala ile giy giy giy val ala val 

541/181 571/191 

atg ggc aac age ggt ggg cag ggc gga acg ccc cgt gcg gtg 

met giy asn ser giy giy gin giy giy thr pro arg ala val 

601/201 631/211 

etc ggc caa acc gtg cag gcg teg gat teg ctg acc ggt gcc 

leu giy gin thr val gin ala ser asp ser leu thr giy ala 

661/221 691/231 

ttg ate cag ttc gat gcc gcg ate cag ccc ggt gat teg ggc 

leu ile gin phe asp ala ala ile gin pro giy asp ser giy 

721/241 751/251 

eta gga cag gtg gtc ggt atg aac acg gcc gcg tec gat aac 

leu giy gin val val giy met asn thr ala ala ser asp asn 

781/261 811/271 

ggg cag gga ttc gcc att ccg ate ggg cag gcg atg gcg ate 

giy gin giy phe ala ile pro ile giy gin ala met ala ile 

841/281 871/291 

ggt ggg ggg tea ccc acc gtt cat ate ggg cct acc gcc ttc 

giy giy giy ser pro thr val his ile giy pro thr ala phe 

901/301 931/311 

gac aac aac ggc aac ggc gca cga gtc caa cgc gtg gtc ggg 

asp asn asn giy asn giy ala arg val gin arg val val giy 

961/321 991/331 

etc ggc ate tec acc ggc gac gtg ate acc gcg gtc gac ggc 

leu giy ile ser thr giy asp val ile thr ala val asp giy 

1021/341 1051/351 

acc gcg atg gcg gac gcg ctt aac ggg cat cat ccc ggt gac 

thr ala met ala asp ala leu asn giy his his pro giy asp 

1081/361 1111/371 

caa acc aag teg ggc ggc acg cgt aca ggg aac gtg aca ttg 

gin thr lys ser giy giy thr arg thr giy asn val thr leu 

1141/381 
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SEQ ID NO. 922 CGCTCTGGGGCTCAAA tAM CTGCAG/^ lAGGAGA^ GATCCAAATG 
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Experiment of molecular hybridization of a specific to DP428 on 
the genomic DNA of various mycobacterial species 



1 2 3 4 5 6 7 8 9 10 11 12 13 




1: AT. tuberculosis 2: AT. bovis 3: BCG 4: M. africanum 5: cancelled 6: M. 
fortuitum 7: M. simiae 8: M. avium 9: AT. chelonae 10: M. flavescens 11: AT. 
cfordonae 12 : M. marinum 13 : W. Jcansasii 
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